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The Appellants appeal the rejection of Claims 1-5 in the above-captioned patent 
application. These claims were rejected in a final Office Action dated June 22, 2005. Appellants 



mailed a Notice of Appeal September 2 1 , 2005 • n/K/goos HH6UYEH1 00000009 10063587 



I. REAL PARTY IN INTEREST 

Pursuant to 37 C.F.R. 41.37(c)(1), Appellants hereby notify the Board of Patent Appeals 
and Interferences that the real party in interest is the assignee of record for this application, 
Genentech, Inc., 1 DNA Way, South San Francisco, CA 94080. 

H- RELATED APPEALS AND INTERFERENCES 

A Notice of Appeal has been filed in the related Application Nos. 10/063,519; 
10/063,534; 10/063,560; 10/063,591, 10/063,592; 10/063,617; 10/063,661; and 10/063,713. A 
Notice of Appeal and an Appeal Brief have also been filed in the related Application Nos. 
10/063,661; 10/063,530; 10/063,540; 10/063,578; 10/063,584; 10/063,586, 10/063,616; 
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10/063,648; 10/063,652; 10/063,653; 10/063,659; 10/063,660; and 10/063,534. Appellants are 
unaware of any other related appeals or interferences. 

III. STATUS OF THE CLAIMS 

The above-captioned application was filed with Claims 1-6. Claim 1 was amended and 
Claim 6 was canceled in an Amendment and Response to Office Action dated September 22, 
2004. Claim 1 was amended in an Amendment After Final filed November 9, 2005. 
Accordingly, Claims 1-5 are the subject of this appeal. The claims at issue are attached hereto as 
Appendix A, 

IV. STATUS OF AMENDMENTS 

Claims 1-5 were finally rejected by the Examiner in a final Office Action mailed 
December 13, 2004. Appellants mailed a Response to Final Office Action on February 10, 2005, 
where no claims were amended, added or canceled. The Examiner issued an Advisory Action 
dated March 4, 2005, which did not indicate whether or not the evidence submitted in the 
Response to Final Office Action mailed February 10, 2005 would be entered for purposes of 
appeal. On April 1, 2005, Appellants filed an Amendment After Final and Request for 
Continued Examination. The final Office Action mailed June 22, 2005 indicated that the 
Amendment mailed April 1, 2005 was entered. On November 9, 2005, Appellants filed an 
Amendment After Final. Currently, there is no indication of entry of this amendment. 
Applicants submit that entry of the amendment is appropriate in accordance with 37 C.F.R. 
§1.1 16(b). 

V. SUMMARY OF THE CLAIMED SUBJECT MATTER 
Independent Claim 1 reads: 

1. An isolated antibody that specifically binds to the polypeptide of SEQ ID NO: 78. 

Various aspects of the claimed antibodies are described in the specification at, for 
example, paragraphs [0024], [0225], [0238]-[0248], and [0361]-[0405], and Figure 78. SEQ ID 
NO: 78 is disclosed in the Sequence Listing appended to the application. 
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VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL AND GROUPING 
OF THE CLAIMS 

The Examiner has rejected Claims 1-5 under 35 U.S.C. § 101, stating that the claimed 
invention is not supported by either a specific and substantial asserted utility or a well- 
established utility. 

The Examiner also has rejected Claims 1-5 under 35 U.S.C. § 112, first paragraph, as 
lacking enablement. The Examiner asserts that since the claimed invention is not supported by 
either a specific or substantial asserted utility or a well-established utility, one skilled in the art 
would not know how to use the claimed invention. 

The Examiner also has rejected Claims 1-5 under 35 U.S.C. § 102(b) as being anticipated 
by WO 01/16318 or WO 00/12708. The Examiner asserts that these publications disclose all 
claim limitations. The Examiner states that the cited references are properly prior art because the 
instant specification and priority documents do not meet the requirements of 35 U.S.C. § 112, 
first paragraph, and, therefore, the claims are not entitled to the benefit of priority to the earlier 
filed applications. 

Claims 1-5 can be considered as a group for purposes of the utility, enablement and 
anticipation rejections. 

VIL APPELLANTS^ ARGUMENT 
A. Summarv of the Arguments 

1. Utility Rejection 

The first issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility" for the claimed invention. See Examination Guidelines, 66 Fed. 
Reg. 1092 (2001). Appellants have asserted that the claimed antibodies to the polypeptide of 
SEQ ID NO: 78 (the PRO 1357 polypeptide) are useful as diagnostic tools for cancer, particularly 
for lung and stomach cancer. This asserted utility is specific, substantial, and credible. 

Briefly stated. Appellants' asserted utility is based in part on the disclosure in Example 

18 of the instant application that the mRNA encoding the PRO 1357 polypeptide is more highly 

expressed in normal stomach and lung tissue compared to stomach and lung tumor, respectively. 

It is well-established that there is a reasonable correlation between changes in mRNA level for a 

particular gene and a corresponding change in the level of expression of the encoded polypeptide, 
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such that increasing or decreasing the amount of mRNA for a particular gene leads to a 
corresponding increase or decrease in the amount of the encoded protein. Thus, one of skill in 
the art would be more likely than not to beheve that, like the PR01357 mRNA, the PR01357 
protein is more highly expressed in normal stomach and lung tissue compared to stomach and 
lung tumor, respectively. This differential expression of PR01357 polypeptide is useful for 
distinguishing stomach and lung tumor tissue from its normal tissue counterpart. Therefore, the 
claimed antibodies have a specific, substantial and credible utility as diagnostic tools for cancer, 
particularly stomach and lung cancer, as is explained in more detail below. 

2. Enablement Rejection 

The second issue before the Board is whether Appellants have enabled the pending 
claims such that one of skill in the art would be able to make and use the claimed invention. The 
Examiner has rejected Claims 1-5 under 35 U.S.C. §112, first paragraph, arguing that the 
claimed subject matter was not described in the specification in such a way as to enable one 
skilled in the art to use the invention. The Examiner recites the factors for determining 
enablement from In re Wands, 858 F.2d 731, 8 U.S.P.Q, 2d 1400 (Fed. Cir. 1988), relying on the 
arguments made in support of the rejection for lack of utility. 

Appellants submit that Claims 1-5 are enabled such that one of skill in the art could make 
and use the claimed antibodies without undue experimentation. The Examiner's basis for 
rejection of the claims is grounded on the lack of utility rejection. Appellants submit that since 
the claimed antibodies have utility, and since the enablement rejection is grounded on the lack of 
utility rejection, the claimed antibodies also are enabled. Notwithstanding the foregoing, 
Appellants submit that the specification fully enables how to make and use the claimed 
antibodies, and it was well within the knowledge of those skilled in the art how to make 
antibodies which are specific to a disclosed polypeptide sequence. See In re Wands, 858 F.2d 
731 (reversing the Board's decision of non-enablement and holding that as of 1980, undue 
experimentation was not required to make high-affinity monoclonal antibodies to a target 
peptide). Thus, one of skill in the art would be able to make and use the claimed antibodies 
without undue experimentation, and the Examiner has not made any arguments to the contrary. 
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3. Anticipation Rejection 

The third issue before the Board is whether Claims 1-5 are anticipated under 35 U.S.C. 
§ 102(b) by two publications cited by the Examiner. The cited publications were published in 
March 9, 2000 and March 8, 2001, respectively. Appellants claim priority to applications dating 
as early as 1998. The subject matter of the present application was disclosed in, and therefore is 
entitled to the priority date of, a priority application filed August 24, 2000. The Examiner states 
that the present application is not entitled to an earlier priority date because neither the present 
application nor any earlier application supports the claimed subject matter in accordance with 35 
U.S.C. §112. Appellants submit that because the claims are fully supported by the specification 
and the priority documents. Appellants are entitled to a filing date of no later than August 24, 
2000. In view of Appellants priority date, the cited references caimot be prior art under 35 U.S.C. 
§ 102(b). 

B. Utility Rejection 

The first issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility." See Examination Guidelines ("Utility Guidelines"), 66 Fed. 
Reg. 1092 (2001). Appellants have asserted that the claimed antibodies are usefiil as diagnostic 
tools for cancer, particularly for stomach and lung cancer. This asserted utility is specific, 
substantial, and credible. 

* 

L Claim Groupins 

For the utility rejection, Appellants group all of Claims 1-5 in a single group. 

■ 

2. Summary of Appellants ' Arsuments 

Appellants' asserted utility is based in part on the disclosure in Example 18 of the instant 
application that the mRNA encoding the PR01357 polypeptide is more highly expressed in 
normal stomach and lung tissue compared to stomach and lung tumor, respectively. It is well- 
established that there is a reasonable correlation between changes in mRNA level for a particular 
gene and a corresponding change in the level of expression of the encoded polypeptide, such that 
increasing or decreasing the amount of mRNA for a particular gene leads to a corresponding 
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increase or decrease in the amount of the encoded protein. Thus, one of skill in the art would be 
more likely than not to believe that, like the PRO 1357 mRNA, the PRO 1357 protein is more 
highly expressed in normal stomach and lung tissue compared to stomach and lung tumor, 
respectively. This differential expression of PRO 1357 polypeptide is useful for distinguishing 
stomach and lung tumor tissue from its normal tissue counterpart. Therefore, the claimed 
antibodies to the PRO 1357 polypeptide have a specific, substantial and credible utility as 
diagnostic tools for cancer, particularly stomach and lung cancer, as is explained in more detail 
below. 

3. Detailed Arsuments 

a. Utility - Lesal Standard 

A "specific utility" is defined as utility which is "specific to the subject matter claimed," 
in contrast to "a general utility that would be applicable to the broad class of the invention." See 
M.P,E.P. § 2107.01 I. For example, it is generally not enough to state that a nucleic acid is 
useful as a diagnostic tool without also identifying the condition that is to be diagnosed. 

The requirement of "substantial utility" defines a "real world" use, and derives from the 
Supreme Court's holding in Brenner v, Manson, 383 U.S. 519, 534 (1966) stating that "[t]he 
basic quid pro quo contemplated by the Constitution and the Congress for granting a patent 
monopoly is the benefit derived by the public from an invention with substantial utility." In 
explaining the "substantial utility" standard, M.P,E.P. § 2107.01 cautions, however, that Office 
personnel must be careful not to interpret the phrase "immediate benefit to the public" or similar 
formulations used in certain court decisions to mean that products or services based on the 
claimed invention must be "currently available" to the public in order to satisfy the utility 
requirement. "Rather, any reasonable use that an applicant has identified for the invention that 
can be viewed as providing a public benefit should be accepted as sufficient, at least with regard 
to defining a 'substantial' utility." M.P.E.P. § 2107.01 (emphasis added); see also Nelson v. 
Bowler 626 F.2d 853, 856, 206 U.S.P.Q. 881, 883 (CCPA 1980). 

Indeed, the Guidelines for Examination of Applications for Compliance With the Utility 
Requirement, set forth in M.P.E.P. § 2107 11(B)(1) gives the following instruction to patent 
examiners: "If the applicant has asserted that the claimed invention is usefiil for any particular 
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practical purpose . . . and the assertion would be considered credible by a person of ordinary skill 
in the art, do not impose a rejection based on lack of utility." 

Finally, in assessing the credibility of the asserted utility, the M.P.E.P. states that "to 
overcome the presumption of truth that an assertion of utility by the applicant enjoys" the PTO 
must establish that it is "more likely than not that one of ordinary skill in the art would doubt (i.e., 
'question') the truth of the statement of utility." M.P.KP, § 2107.02 III A. 

b. Utility - Burden of Proof 

It is well established that a specification which contains a disclosure of utility which 
corresponds in scope to the subject matter sought to be patented "must be taken as sufficient to 
satisfy the utility requirement of § 101 for the entire claimed subject matter unless there is reason 
for one skilled in the art to question the objective truth of the statement of utility or its scope." In 
re Longer, 503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). Thus "the PTO has the 
initial burden of challenging a presumptively correct assertion of utility in the disclosure." In re 
Brana, 51 F,3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). Only after the PTO provides 
evidence showing that one of ordinary skill in the art would reasonably doubt the asserted utility 
does the burden shift to the applicant to provide rebuttal evidence sufficient to convince such a 
person of the invention's asserted utility. Id, 

c. Utility - Standard of Proof 

Compliance with 35 U.S.C. § 101 is a question of fact. Raytheon v. Roper, 724 F.2d 951, 

956, 220 U.S.P.Q. 592, 596 (Fed. Cir. 1983), In re Ziegler, 992 F.2d 1197, 1200, 26 U.S.P.Q.2d 

1600 (Fed. Cir. 1993), In re Fisher Case No. 04-1465. The evidentiary standard to be used 

throughout ex parte examination in setting forth a rejection is a preponderance of the evidence, 

or "more likely than not" standard. In re Oeiiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 

1444 (Fed. Cir. 1992). This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P.E.P. § 2107.02, part VII (emphasis in original, 
citations omitted). 
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The Court of Appeals for the Federal Circuit has stated that the standard for satisfying the 

utility requirement is a low one: 

The threshold of utility is not high : An invention is "useful" under section 101 if 
it is capable of providing some identifiable benefit. See Brenner v. Manson, 383 
U.S. 519, 534, 86 S.Ct 1033, 16 L.Ed.2d 69 (1966); Brooktree Corp, v. Advanced 
Micro Devices, Inc., 977 F.2d 1555, 1571 (Fed. Cir. 1992) ('To violate § 101 the 
claimed device must be totally incapable of achieving a useful result"); Fuller v. 
Bergen 120 F. 274, 275 (7th Cir. 1903) (test for utility is whether invention "is 
incapable of serving any beneficial end"). Juicy Whip, Inc. v. Orange Bang, Inc., 
185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999) (emphasis added). 

The low threshold for satisfying the utility requirement is reflected in the standard set by the 
Federal Circuit for invalidating a patent based on a lack of utility: "[T]he fact that an invention 
has only limited utility and is only operable in certain applications is not grounds for finding lack 
of utility. Some degree of utility is sufficient for patentability. Further, the defense of non- 
utility cannot be sustained without proof of total incapacity ." Envirotech Corp, v. Al George, 
Inc., 730 F.2d 753, 762, 221 U.S.P.Q. 473 (Fed. Cir. 1984) (emphasis added, citations omitted). 

Because the standard for satisfying the utility requirement is so low, requiring total 
incapacity for a finding of no utility, the M.P.E.P. cautions that: 

Rejections under 35 U.S.C. 101 have been rarely sustained by federal courts. 
Generally speaking, in these rare cases, the 35 U.S.C. 101 rejection was sustained 
[] because the applicant . . . asserted a utility that could only be true if it violated a 
scientific principle, such as the second law of thermodynamics, or a law of nature, 
or was wholly inconsistent with contemporary knowledge in the art. M.P.E.P. 
§ 2107.02 III B., citing In re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 
(C.C.P.A. 1967) (underline emphasis in original, italic emphasis added). 

rf. Appellants Asserted a Specific, Substantial and Credible Utility that is 

Sufficient to Satisfy the Utility Requirement ofS 101 

The claimed subject matter is directed to antibodies that specifically bind to the 

polypeptide of SEQ ID NO: 78. The polypeptide of SEQ ID NO: 78 (referred to as "PR01357 

polypeptide") is encoded by the polynucleotide of SEQ ID NO: 77 (also referred to as 

DNA6488 1-1602). Specification at \\ [0103]-[0104]. Appellants have asserted that the claimed 

antibodies are useful as diagnostic tools for cancer, particularly stomach and lung cancer. 

In "Example 18: Tumor Versus Normal Differential Tissue Expression Distribution" 

Appellants disclose that the mRNA encoding PRO 1357 polypeptide is more highly expressed in 

normal stomach and lung tissue compared to stomach and lung tumor, respectively. 
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Specification at ^ [0529]-[0530] and accompanying tables. As explained in paragraph [0530], 
the differential expression of the PRO 1357 mRNA was detected using the well-established 
technique of quantitative PGR amplification of cDNA libraries isolated from different human 
normal and tumor tissue samples. To ensure that equivalent amounts of nucleic acid were used 
in each reaction, the cDNA for P-actin was used as a control. 

The specification teaches that identification of the differential expression of a PRO 
polypeptide-encoding mRNA in one or more tumor tissues as compared to one or more normal 
tissues of the same tissue type "renders the molecule useful diagnostically for the determination 
of the presence or absence of tumor in a subject suspected of possessing a tumor." Specification 
at TI [0530]. The Examiner has recognized that the teachings in the specification of differential 
expression of the PRO 1357 mRNA are sufficient to establish a utility for the nucleic acid 
encoding PRO 1357 polypeptide: "[I]t is agreed that the polynucleotide of SEQ ID NO;77 has 
this specific utility." Office Action dated June 22, 2005, at page 10. 

Appellants submit that because it is well established that changes in mRNA levels lead to 

changes in the level of the encoded protein, based on the teachings of the specification, one 

would expect the PRO 1357 protein to be differentially expressed in stomach and lung tumors. 

The specification states that PRO polypeptides "may also be used diagnostically for tissue typing, 

wherein the PRO polypeptides of the present invention may be differentially expressed in one 

tissue as compared to another, preferably in a diseased tissue as compared to a normal tissue of 

the same tissue type." Specification at \ [0336]. The specification also discloses that PRO 

polypeptides and polypeptides related thereto can be used to generate anti-PRO antibodies. Id at 

\ [0364] and \ [0367]. The specification teaches that such antibodies to PRO polypeptides can 

be useful as diagnostic tools: 

[A]nti-PRO antibodies may be used in diagnostic assays for PRO [polypeptide], 
e.g., detecting its expression (and in some cases, differential expression) in 
specific cells, tissues, or serum. Various diagnostic assay techniques known in 
the art may be used, such as competitive binding assays, direct or indirect 
sandwich assays and immunoprecipitation assays conducted in either 
heterogeneous or homogeneous phases. Specification at \ [0407]. 

Taken together, the specification clearly discloses the use of the claimed antibodies as 

tools in diagnosing cancer, particularly stomach and lung cancer. This utility is substantial, as 

one of skill in the art will recognize that the diagnosis of cancer is a "real world" use; it is 

specific, as the diagnosis of stomach and lung cancer is not a utility that applies to the broad 
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class of antibodies; and it is credible, as it not a utility "that could only be true if it violated a 
scientific principle, ...or a law of nature, or [is] wholly inconsistent with contemporary 
knowledge in the art." M.P.E.P. § 2107.02 III B., citing In re Gazave, 379 F.2d 973, 978, 154 
U.S.P.Q. 92, 96 (C.C.P.A. 1967). Because Appellants' specification contains a disclosure of 
utility which corresponds in scope to the claimed subject matter, the asserted utility "must be 
taken as sufficient to satisfy the utility requirement of § 101 for the entire claimed subject matter 
unless there is reason for one skilled in the art to question the objective truth of the statement of 
utility or its scope." In re Langer, 503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A, 1974). 
Therefore, the burden of establishing a prima facie case of lack of utility rests with the PTO. See, 
In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("the PTO has the initial 
burden of challenging a presumptively correct assertion of utility in the disclosure"). 

4. The Data in Example 18 are Data Re£ardin2 Differential mRNA Levels, not 
Gene Amplification 

Appellants begin by clarifying that the data concerning the differential expression of the 
PRO 1357 gene presented in Example 18 relate to gene expression, not gene amplification . The 
description of Example 18 makes clear that the results were obtained by quantitative PCR 
amplification of cDNA libraries. It is well known in the art that cDNA libraries are made from 
mRNA, and reflect the level of mRNA for a particular gene in the source tissue. Thus, Example 
18 is reporting a measure of the expression of the PRO 1357 gene, i.e., mRNA levels, not its 
amplification, i.e., the number of copies of the PRO 1357 gene in the genome. 

For this reason, the relationship between gene amplification and the level of protein 
expression is not relevant to the instant application. The distinction between gene amplification 
and protein expression has been previously explained to the Examiner. Notwithstanding 
Appellants' efforts, the Examiner continues to cite Konopka (Proc. Natl. Acad. Sci. USA, (1986) 
83:4049-52) as providing "evidence showing lack of correlation of between gene amplification 
and increased polypeptide levels." Office Action dated June 22, 2005, at page 5. 

Further, the Examiner uses this evidence as basis for the utility rejection: 

Given how small the unknown amount that DNA copy number of PRO 1357 
decreased in tumors, and the evidence provided by Haynes et al., Hu et al. 
(discussed above) and Konopka et al., one skilled in the art would not have 
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assumed that a small decrease in gene copy number would correlate with 
significantly increased mRNA or polypeptide levels . The level of decrease of the 
encoding nucleic acid is not disclosed. One skilled in the art would have to do 
further research to determine whether or not the PRO 1357 polypeptide levels 
decreased significantly in the tumor samples. Such further research requirements 
make it clear that the asserted utility is not yet in currently available form, i.e., it 
is not substantial. Office Action dated June 22, 2005, at page 5 (emphasis added). 

The correlation between gene copy number and mRNA/polypeptide levels is completely 
irrelevant to the instant application. Appellants have provided reliable evidence of differential 
PR01357 mRNA levels in certain tumors by examining cDNA libraries, not genomic DNA. 
Whether this differential mRNA expression is due to changes in gene copy number, transcription 
rates, a combination of these, or some other known or unknown cellular mechanism is simply not 
relevant to Appellants' asserted utility. Regardless of the cause, the differential expression of 
PRO 1357 mRNA and the resulting differential expression of PRO 1357 protein can be used as a 
molecular marker of lung or stomach cancer to assist in the diagnosis of these diseases. 

The issues before the Board will be greatly simplified if an understanding can be reached 
that the data in Example 18 reflect the level of mRNA for PRO 1357 expressed, /.e., the level of 
PRO 1357 gene expression, and not the number of copies of the PRO 1357 gene present in the 
genome, i.e., gene amplification. Once this is established, it is clear that the gene amplification 
data of the Konopka reference are not relevant to Appellants' asserted utility, and Appellants' 
references and declarations regarding the relationship between mRNA levels and protein levels 
are directly relevant. 

5. The Examiner's Arsuments 

The most recent Office Action, dated June 22, 2005, states that the claims are rejected as 

lacking utility "for the reasons set forth in the previous Office action." Office Action dated Jime 

22, 2005, at page 2. However, not all of the Examiner's reasoning for rejecting the claims as 

lacking utility provided in previous Office Actions are consistent with the reasoning of the Office 

Action dated June 22, 2005. For example, in the Office Action dated June 22, 2005, the 

Examiner states, "[I]t is agreed that the polynucleotide of SEQ ID NO:77 has this specific 

utility." Office Action dated June 22, 2005, at page 10. Whereas, the Examiner had previously 

stated "it should be noted that with the Office maintaining that the instant specification does not 

support utility for the polynucleotide as a diagnostic tool, a diagnostic use for the protein or 
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antibody is likewise not supported, regardless of whether their levels correspond." Advisory 
Action dated March 4, 2005, at page 2. In view of the foregoing, the Examiner's arguments 
discussed herein are those provided in, or consistent with, the Office Action dated June 22, 2005. 

The Examiner has explained that the basis of the utility rejection "is the insufficiency of 
disclosure to support a specific and substantial or well established utility" because "there is 
critical information lacking which includes: whether differences in nucleic acid expression of 
PRO 1357 were significant, under what conditions differences could be detected, and what levels 
(relative or absolute) were detected in tumor and normal control, the skilled artisan cannot use 
(whether in vivo or in vitro) the claimed invention." Office Action dated June 22, 2005 at page 4. 

In response to Appellants arguments, the Examiner has asserted that "even if the 
encoding polynucleotide has utility, on [sic] cannot on that basis alone support a utility for the 
encoded protein because the prior art provides sufficient support to make a correlation between 
mRNA and encoded protein level unpredictable." Office Action dated June 22, 2005 at page 3. 
The Examiner further stated, "While one can find prior art that supports a 'significant 

■ 

probability' that mRNA and protein levels will correlate, there is influential prior art of record 
that requires the Examiner maintain that as a whole, the prior art does not provide a reasonable 

■ 

expectation that expression of the nucleic acid of SEQ ID NO:77 positively correlates with the 
expression of the protein of SEQ ID NO:78." Office Action dated June 22, 2005 at pages 4-5. 
As evidentiary support, the Examiner has cited Hu et al (J. Proteome Res., (2003) 2(4):405-12) 
as teaching that, "for genes displaying a 5-fold change or less in tumors compared to normal, 
there was no evidence of a correlation between altered gene expression and a known role in the 
disease." Office Action dated June 22, 2005 at page 5. The Examiner also cited Haynes et al 
(Electrophoresis, (1998) 19(1 1):1862-71) and Konopka et al (Proc. Natl. Acad. Sci. USA, 
(1986) 83:4049-52) as supporting the assertion that there is "no strong correlation between 
protein and transcript levels," and there is a "lack of correlation between gene amplification and 
increased polypeptide levels." Office Action dated June 22, 2005 at page 5. 

6. The Examiner has not Established a Prima Facie case that Claims 1-5 Lack 
Utility 

The Examiner has not met the burden of "provid[ing] evidence showing that one of 
ordinary skill in the art would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 
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1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The Examiner has the burden of presenting 
"countervailing facts and reasoning sufficient to establish that a person of ordinary skill would 
not believe the Appellant's assertion of utility." M.P.E.P, at § 2107.02 III.A., citing in re Brana, 
51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("Only after the PTO provides 
evidence showing that one of ordinary skill in the art would reasonably doubt the asserted utility 
does the burden shift to the Appellant to provide rebuttal evidence") (emphasis added). 

As discussed in further detail below, the Examiner has asserted that the specification does 
not disclose sufficient experimental details to support a utility for the claimed antibodies. In an 
attempt to rebut Appellants' traversal of the utility rejection, the Examiner cited three references. 
Appellants submit that the Examiner has erroneously placed the initial burden on Appellants to 
substantiate their asserted utility, and further submit that the Examiner's rebuttal arguments and 
evidence do not support the utility rejection of the claimed antibodies. 

a. The Examiner basis for the Utility Rejection is Improper 

The Examiner has improperly placed the burden on Appellants to initially establish the 
utility of the claimed antibodies. Without relying on any facts or reasoning, the Examiner has 
held that the claims lack utility because the specification provides insufficient experimental 
details. For example, the Examiner states: 

The issue in this application is the insufficiency of disclosure to support a specific 
and substantial or well established utility or to allow the skilled artisan to use the 
claimed invention without undue experimentation. Because as previously 
discussed there is critical information lacking which includes: whether differences 
in nucleic acid expression of PRO 1357 were significant, under what conditions 
differences could be detected, and what levels (relative or absolute) were detected 
in tumor and normal control, the skilled artisan cannot use (whether in vivo or in 
vitro) the claimed invention. It is possible that with the lacking information in 
hand, the skilled artisan could make a reasonable correlation between at least 
nucleic acid expression data and diagnostic utility. Advisory Action dated 
March 4, 2005, at page 2. 

Thus, the Examiner holds that the claimed antibodies cannot have utility unless the 
specification discloses various experimental details such as assay conditions and experimentally 
detected expression levels. This standard is inconsistent with PTO policy and the guidelines 
provided by the courts: "In most cases, an applicant's assertion of utility creates a presumption of 
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utility that will be sufficient to satisfy the utility requirement of 35 U.S.C. 101." M.P.KP. 
§2107.02 III. A. "[T]he PTO has the initial burden of challenging a presumptively correct 
assertion of utility in the disclosure. Only after the PTO provides evidence showing that one of 
ordinary skill in the art would reasonably doubt the asserted utility does the burden shift to the 
applicant to provide rebuttal evidence sufficient to convince such a person of the invention's 
asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) 
(citations omitted). 

As provided herein, Appellants have asserted a utility for the claimed antibodies. For 
example, the specification teaches identification of the differential mRNA expression, diagnostic 
use of PRO polypeptides, use of PRO polypeptides to generate anti-PRO antibodies, and use of 
the anti-PRO antibodies (see Specification at, e.g., \\ [0364], [0366], [0367], [0407] and [0530]). 
Accordingly, the PTO has the initial burden of challenging Appellants' asserted utility; 
Appellants do not have the burden of disclosing details such as assay conditions and 
experimentally detected expression levels in order to initially establish utility of the claimed 
subject matter. 

Appellants previously traversed the Examiner's rejection, arguing that the Examiner was 
improperly requiring that the specification disclose various experimental details in order to 
initially establish the utility of the claimed subject matter. In response, the Examiner stated: 

The Office is not requiring anything. The specification has failings which the 
Examiner pointed out. Office Action dated June 22, 2005, at page 7. 

This statement is inconsistent with the Examiner's rejection because the Examiner is 
clearly requiring the specification to disclose "critical information" "which includes: whether 
differences in nucleic acid expression of PR01357 were significant, under what conditions 
differences could be detected, and what levels (relative or absolute) were detected in tumor and 
normal control" in order to establish utility of the claimed antibodies. See Advisory Action dated 
March 4, 2005, at page 2. PTO policy and the guidelines provided by the courts make it clear 
that applicants need only provide an assertion of utility, which is presumptively correct. In the 
present case. Appellants have asserted a utility for the claimed antibodies. Thus, the burden is on 
the Examiner to provide "evidence showing that one of ordinary skill in the art would reasonably 
doubt the asserted utility." The Examiner does not rely on any evidence or reasoning to 
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conclude that the claimed antibodies lack utility. Instead, the Examiner asserts that "the 
specification has failings," and erroneously places the burden on Appellants' specification to 
disclose sufficient information in order to initially establish the utility of the claimed antibodies. 
Because such a basis for a utility rejection is inconsistent with PTO policy and the guidelines 
provided by the courts, this rejection should be withdrawn. 

b. The Examiner's Evidence is Insufficient to Establish the Claims as 
Prima Facie lackins Utility 
Notwithstanding the impropriety of the Examiner's utility rejection, Appellants have 
responded to the Examiner's rejection by providing evidence in support of utility of the claimed 
antibodies (discussed below). In response to Appellants evidence and supporting arguments, the 
Examiner has cited three references in rebuttal. The three references cited by the Examiner, Hu 
et al, Haynes et aL, and Konopka et al, do not support the Examiner's position, for the reasons 
provided below. Instead, the evidence presented by the Examiner supports Appellants' asserted 
utility. Appellants submit that there is no evidence in the record to support the Examiner's 
argument that Appellants' asserted utility is not substantial, and the invention is incomplete. 
Absent some significant evidence to support these assertions, the Examiner cannot establish a 
prima facie showing that one of skill in the art would reasonably doubt the asserted utility. 
Accordingly, all claims are improperly rejected as lacking utility. 

i. Hu etal. 

The Examiner has cited Hu et al (J. Proteome Res., (2003) 2(4):405-12) in asserting that 
for genes displaying a 5-fold change or less in tumors compared to normal, there was no 
evidence of a correlation between altered gene expression and a known role in the disease. 

In Hu, the researchers used an automated literature-mining tool to summarize and 
estimate the relative strengths of all human gene-disease relationships published on Medline. 
They then generated a microarray expression dataset comparing breast cancer and normal breast 
tissue. Using their data-mining tool, they looked for a correlation between the strength of the 
literature association between the gene and breast cancer, and the magnitude of the difference in 
expression level. They report that for genes displaying a .5-fold change or less in tumors 
compared to normal, there was no evidence of a correlation between altered gene expression and 
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a known role in the disease. See Hu at 411. However, among genes with a 10-fold or more 
change in expression level, there was a strong correlation between expression level and a 
published role in the disease. Id. at 412. Importantly, Hu reports that the observed correlation 
was only found among estrogen receptor-positive tumors, not ER-negative tumors. Id. 

The general findings of Hu are not surprising ~ one would expect that genes with the 
greatest change in expression in a disease would be the first targets of research, and therefore 
have the strongest known relationship to the disease as measured by the number of publications 
reporting a connection with the disease. The correlation reported in Hu only indicates that the 
greater the change in expression level, the more likely it is that there is a published or known role 
for the gene in the disease, as found by their automated literature-mining software. Thus, Hu's 
results merely reflect a bias in the literature toward studying the most prominent targets, and 
reflect nothing regarding the ability of a gene that is 2-fold or more differentially expressed in 
tumors to serve as a disease marker. 

Hu acknowledges the shortcomings of this method in explaining the disparity in Hu's 
findings for ER-negative versus ER-positive tumors: Hu attributes the "bias in the literature" 
toward the more prevalent ER-positive tumors as the explanation for the lack of any correlation 

w 

between number of publications and gene expression levels in less-prevalent (and, therefore, less 
studied) ER-negative tumors. Id, Because of this intrinsic bias, Hu's methodology is unlikely to 
ever note a correlation of a disease with less differentially-expressed genes and their 
corresponding proteins, regardless of whether or not an actual relationship between the disease 
and less differentially-expressed genes exists. Accordingly, Hu's methodology yields results that 
provide little or no information regarding biological significance of genes with less than 5-fold 
expression change in disease. Nowhere in Hu does it say that a lack of correlation in their study 
means that genes with a less than five-fold change in level of expression in cancer cannot serve 
as a molecular marker of cancer. 

Appellants submit that a lack of known role for the PRO 1357 gene in cancer does not 
prevent its use as a diagnostic tool for cancer. There is a difference between use of a gene for 
distinguishing between tumor and normal tissue on the one hand, and establishing a role for the 
gene in cancer on the other. Genes with lower levels of change in expression may or may not be 
the most important genes in causing the disease, but the genes can still show a consistent and 
measurable change in expression. While such genes may or may not be good targets for fiirther 
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research, they can nonetheless be used as diagnostic tools. Thus, Hu does not refute Appellants' 

assertion that the PRO 1357 gene can be used as a cancer diagnostic tool because it is 

differentially expressed in certain tumors. 

In response to Appellants' previous arguments, the PTO has stated that the findings of Hu 

et al suggest that there is a correlation between expression level and activity. The PTO quotes 

the portion of Hu stating: 

It is not uncommon to see expression changes in microarray experiments as small 
as 2-fold reported in the literature. Even when these expression changes are 
statistically significant, it is not always clear if they are biologically meaningful. 
Hu at page 411, right column. 

As the PTO has asserted, Hu studied differential gene expression and a known role in a 
disease. See, e.g., Office Action dated June 22, 2005, at page 5. Thus, Hu's analysis of 
differential expression of a gene whose role in a disease is "biologically meaningful" to the 
disease is completely different from Appellants' asserted differential expression of a gene for 
diagnostic purposes. Even if a gene does not have a biologically meaningful role in a disease, 
this does not indicate that the gene does not show a consistent and measurable change in 
expression in the cancer. Whether or not a differentially expressed gene has a biologically 
meaningful role in a disease does not change the fact that differential expression of a gene and 
encoded polypeptide can be used in diagnosis of a disease. The lack of a biologically 
meaningful role of PRO 1357 in cancer, for example, is irrelevant to whether its differential 
expression can be used to assist in diagnosis of cancer - one does not need to know why 
PRO 1357 is differentially expressed, or the biological meaning of the differential expression, in 
order to exploit the differential expression to distinguish tumor from normal tissue. 

The PTO's own written policies recognize that the utility of a nucleic acid does not 
depend on the function of the encoded gene product. The Utility Examination Guidelines 
published on January 5, 2001 state: "In addition, the utility of a claimed DNA does not 
necessarily depend on the function of the encoded gene product. A claimed DNA may have a 
specific and substantial utility because, e,g, it hybridizes near a disease-associated gene or it has 
a gene regulating activity." (Federal Register, Volume 66, page 1095, Comment 14). Similarly, 
here the disclosed nucleic acids, as well as the encoded polypeptides and related antibodies, are 
useful for determining whether an individual has cancer regardless of whether or not they are the 
cause of the cancer. 
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Any requirement of a known role for PRO 1357 in cancer for utility is also inconsistent 
with the analogous standard for therapeutic utility of a compound where "the mere identification 
of a pharmacological activity of a compound that is relevant to an asserted pharmacological use 
provides an 'immediate benefit to the public' and thus satisfies the utility requirement." M.P.E.P. 
§ 2701.01 (emphasis original). Here, the mere identification of altered expression in tumors is 
relevant to diagnosis of tumors, and, therefore, provides an immediate benefit to the public. 

The Examiner has acknowledged that "[i]t is correct that the role of a gene need not be 
known." Office Action dated June 22, 2005, at page 10. However, the Examiner has cited Hu in 
asserting that "for genes displaying a 5-fold change or less in tumors compared to normal, there 
was no evidence of a correlation between altered gene expression and a known role in the 

■ 

disease." See, e.g., Office Action dated June 22, 2005, at page 5. Thus, utility that is not based 
on knowing the role of a gene in a disease is accepted by the Examiner as a permissible utility 
that may be asserted, and such an asserted utility is unaffected by the teachings of Hu. 
Accordingly, Appellants' asserted diagnostic utility, which is not based on knowing the role of 
the gene in a disease, is accepted by the Examiner as a permissible utility that may be asserted, 
and is unaffected by the teachings of Hu. 

Further, as noted above, the utility of the nucleic acid encoding the PRO 1357 polypeptide 
is acknowledged by the Examiner. Thus, the Examiner does not take the position that Hu 
demonstrates that one skilled in the art would doubt the utility of the nucleic acid encoding the 
PRO 1357 polypeptide. Further, the Examiner does not assert that the teachings of Hu indicate 
that differential protein expression is less reliable than differential nucleic acid expression. Nor 
does the Examiner assert that Hu teaches that changes in mRNA levels do not typically 
correspond with changes in levels of the encoded polypeptide. In short, the Examiner provides 
no explanation for why Hu is insufficient to demonstrate that the nucleic acid encoding the 
PRO 1357 polypeptide lacks utility, but is sufficient to demonstrate that the PRO 1357 
polypeptide lacks utility. As such, the Examiner provides no reason to regard Hu as evidence 
that challenges the utility of the claimed antibodies. 

In summary, Appellants submit that the teachings of Hu are directed to a known role of 
differentially expressed genes in disease. The Examiner's characterization of Hu is in accord 
with this. Appellants submit that the claimed antibodies need not have a known role in a disease 
in order to have utility. The Examiner agrees with this. Appellants' teachings in the 
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specification of differential expression of the nucleic acid encoding the PRO 1357 polypeptide 
demonstrate a utility of the PRO 1357 nucleic acids over the teachings of Hu. The Examiner 

a 

agrees with this. Appellants submit that Hu provides no teachings that distinguish differential 
expression of nucleic acids from differential expression of polypeptides. The Examiner does not 
assert that Hu contains such teachings. Thus, Appellants submit that (a) Hu's teachings are not 
relevant to Appellants' asserted diagnostic utility, (b) Hu's teachings are insufficient to question 
the utility of PRO 1357 nucleic acids, and (c) Hu's teachings are not any more relevant to 
polypeptides than to nucleic acids. None of the Examiner's assertions are inconsistent with this. 
Accordingly, Appellants submit that Hu does not discredit Appellants' asserted utility, and the 
Examiner provides no reasoning to think otherwise. 

ii. Haynes etah 

The Examiner has cited Haynes et al (Electrophoresis, (1998) 19(11): 1862-71), as 
teaching that there is "no strong correlation between protein and transcript levels." Office Action 
dated June 22, 2005, at page 5. The Examiner also referred to Haynes in asserting that 
"polypeptide levels cannot be accurately predicted from mRNA levels." Office Action dated 
June 22, 2005, at page 5. For the reasons discussed below, Haynes is not contrary to Appellants' 
assertion that generally speaking, changes in mRNA levels lead to corresponding changes in the 
level of polypeptide. 

Haynes studied whether there is a correlation between the level of mRNA expression and 
the level of protein expression for 80 selected genes from yeast. The genes were selected 
because they constituted a relatively homogeneous group with respect to predicted half-life and 
expression level of the protein products. See Haynes at page 1863. Haynes did not examine 
whether a change in transcript level for a particular gene led to a change in the level of 
expression of the corresponding protein. Instead, Haynes determined whether the steady-state 
transcript level correlated with the steady-state level of the corresponding protein based on an 
analysis of 80 different genes. 

Haynes reported to have "found a general trend but no strong correlation between protein 
and transcript levels." Id However, a cursory inspection of Fig. 1 shows a clear correlation 
between the mRNA levels and protein levels measured. This correlation is confirmed by an 
inspection of the full-length research paper from which the data in Fig. 1 were derived, (Gygi et 
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aL, Molecular and Cellular Biology, Mar. 1999, 1720-1730), submitted as Exhibit 5 with 
Appellants' Amendment and Response to Final Office Action mailed February 10, 2005. Gygi 
states that "there was a general trend of increased protein levels resulting from increased mRNA 
levels," with a correlation coefficient of 0.935, indicating a strong correlation. Gygi at page 
1726. Moreover, Gygi also states that the correlation is especially strong for highly expressed 
mRNAs. Id Thus, it is not clear that Haynes even supports the Examiner's position,. as Haynes 
did report a general trend, and Gygi reports a strong correlation between increasing mRNA 
levels and increasing protein levels. 

The Examiner has focused on the portion of Haynes where the authors reported that for 
some of the studied genes with equivalent mRNA levels, there were differences in corresponding 
protein expression, including some that varied by more them 50-fold. Similarly, Haynes reports 
that different proteins with similar expression levels were maintained by transcript levels that 
varied by as much as 40-fold. Office Action dated June 22, 2005, at page 5. Thus, Haynes 
showed that for one type of yeast, similar mRNA levels for different genes did not universally 
result in equivalent protein levels for the different gene products, and similar protein levels for 
different gene products did not universally result from equivalent mRNA levels for the different 
genes. These results are expected, since there are many factors that determine translation 
efficiency for a given transcript, or the half-life of the encoded protein. Not surprisingly, based 
on these results, Haynes concluded that protein levels cannot always be accurately predicted 
from the level of the corresponding mRNA transcript when looking at the level of transcripts 
across different genes . 

Importantly, Haynes did not say that for a single gene, the level of mRNA transcript is 
not positively correlated with the level of protein expression. Appellants have asserted that 
increasing or decreasing the level of mRNA for the same gene leads to an increase or decrease 
for the corresponding protein. Haynes did not study this issue and says absolutely nothing about 
it. One cannot look at the level of mRNA across several different genes to investigate whether a 
change in the level of mRNA a particular gene leads to a change in the level of protein for that 
gene. Therefore, Haynes is not inconsistent with or contradictory to the utility of the instant 
claims, and offers no support for the Examiner's rejection of Appellants' asserted utility. 
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In response to Appellants' previous arguments, the Examiner pointed to specific 

teachings in Haynes et al and Gygi et al regarding observed expression levels. For example, 

the Examiner quoted as Gygi stating: 

The observed level of correlation between mRNA and protein expression levels 
suggest the importance of posttranslational mechanisms controlling gene 
expression. Such mechanisms include translational control and control of 
protein half-life.... Since these mechanisms are also active in higher eukaryotic 
cells, we speculate that there is no predictive correlation betvyeen steady-state 
levels of mRNA and those of protein in mammalian cells . Office Action dated 
June 22, 2005, at page 8, citing Gygi et al at page 1727 (emphasis added). 

The above-underlined sentence illustrates how the Examiner has misapplied both Haynes 
and Gygi. Appellants have asserted that increasing or decreasing {i.e., changing) the level of 
mRNA for a particular gene leads to a respective increase or decrease (/.e., change) for the 
corresponding protein. Gygi states that the steady-state {i.e., constant) levels of various mRNAs 
do not correlate with the steady-state {i.e., constant) levels of the corresponding polypeptides. 
Appellants' assertions could be illustrated by a two-dimensional plot showing various 
polypeptide concentrations of a particular gene as a function of various mRNA concentrations of 
that same gene. Gygi's and Haynes' assertions are illustrated by a two-dimensional plot showing 
a single polypeptide concentration (the steady-state concentration) for various genes as a 
function of a single mRNA concentration (the steady-state concentration) of those various genes. 
Haynes' and Gygi's plots explored whether or not there was a common relationship between 
steady-state {i.e., constant) mRNA levels and steady-state {i.e., constant) polypeptide levels for 
various genes. In contrast, Appellants assert that generally an increase or decrease {i.e., change) 
in mRNA levels for the same gene leads to a respective increase or decrease {i.e., change ) for the 
corresponding polypeptide. Neither Haynes nor Gygi reported anything regarding changes in 
mRNA levels or changes in polypeptide levels for the same gene. Accordingly, the results of 
Haynes or Gygi are not relevant to Appellants' assertions regarding the relationship between 
changes in mRNA levels and the corresponding polypeptide levels. As such, Haynes and Gygi 
cannot support the Examiner's holding of a lack of utility for the claimed antibodies. 



HI. 
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The Examiner has cited a third reference, by Konopka et al (Proc. Natl. Acad. Sci. USA, 
(1986) 83:4049-52), as supporting the assertion that there is a "lack of correlation between gene 
amplification and increased polypeptide levels." Office Action dated June 22, 2005, at page 5. 

As discussed above, the Examiner has confused the relationship between an increase in 
copy number of a gene and the level of mRNA on the one hand, with the relationship between 
mRNA expression and levels of the corresponding protein on the other. In particular, the 
Examiner cites the statement in Konopka that "[p]rotein expression is not related to amplification 
of the abl gene, but to variation in the level of the bcr-abl mRNA production from a single Phi 
template." Office Action dated June 22, 2005, at page 5, citing Konopka at Abstract. The results 
presented in Konopka actually present evidence in support of Appellants' position that there is a 
general understanding in the art that changes in levels of mRNA correlate with changes in levels 
of the corresponding polypeptides. Konopka analyzed the expression patterns of a gene 
associated with certain cancers. The authors show a wide variation in the levels of the protein in 
various cell types, and find that this variation can be attributed to the levels of the corresponding 
mRNA in each cell type. SeCy Konopka, at page 4050. Konopka thus concludes, "these 
combined data suggest that differential bcr-abl mRNA expression from a single gene template is 
responsible for the variable levels of P210 ' [the protein of interest] detected." Id., at page 
4051. Thus, far fi*om supporting the PTO's assertion that it is not the norm that increased 
transcription leads to increased levels of the corresponding polypeptide, Konopka supports the 
position asserted by Appellants - that changes in the level of mRNA, more often than not, 
correspond to changes in the level of the corresponding polypeptide. 

The Examiner agrees with this position. The Examiner states: 

Applicants argue (p. 15) that Konopka et al. looked at gene amplification and 
protein level for bcr-able [sic], finding a lack of correlation, though the mRNA 
and protein level did correlate. This argument is conceded, however, this 
exception looking at a single protein and gene does not support the opposition 
position of strong predictability of correlation between mRNA and protein level. 
Other references have been reviewed here and in previous Office actions which 
support the unpredictability by examining not just one, but a hundred or hundreds 
of mRNAs and corresponding proteins. Office Action dated June 22, 2005, at 
page 9. 

Thus, the Examiner agrees that Konopka is evidence in support of Appellants asserted utility. 

However, instead of considering the teachings of Konopka in evaluating all of the evidence on 

-22- 



Appl. No. : 10/063,587 

Filed : May 3, 2002 



record, the Examiner dismisses her own reference as an "exception" that does not undermine the 
assertion that changes in levels of an mRNA do not typically correspond to changes in levels of 
the encoded protein. The Examiner justifies dismissing Konopka based on "[o]ther references," 
Le., Hu and Haynes. The Examiner states that these "[o]ther references" examined "not just one, 
but a hundred or hundreds of mRNAs and corresponding proteins." Hu is completely silent 
regarding any relationship between mRNA levels and polypeptide levels. As discussed above, 
Haynes did not study the relationship between changes in mRNA levels and resultant 
polypeptide levels for a single gene. Accordingly, Konopka contains relevance to Appellants 
asserted utility that is not addressed in either Haynes or Hu. As such, Konopka should not be 
dismissed as an "exception" to the teachings of these "[o]ther references." 

The teachings of Konopka are more relevant to Appellants' asserted utility than the 
teachings of Hu or Haynes, and support Appellants asserted utility. It is the duty of the 
Examiner to fully consider the totality of the record. See M.P.E.P. § 2107 II.D. By dismissing 
Konopka as an "exception," the Examiner fails in this duty. 

iv. Summary 

The Examiner has cited three references, Hu et al , Haynes et al , and Konopka et aL 
Hu reports that genes with high levels of differential expression in disease more 
frequently are characterized by a known, published role in the disease. Hu is silent regarding 
diagnostic properties of differentially expressed genes and, therefore, has no bearing on 
Appellants' asserted utility. Furthermore, the Examiner acknowledges the utility of the 
differentially expressed nucleic acid encoding the PR01357 polypeptide, and the Examiner 
provides no reason why Hu demonstrates that the PRO 1357 polypeptide, but not the PRO 1357 
nucleic acid, lacks utility. Thus, the factual teachings of Hu are not relevant to Appellants 
asserted utility, and the Examiner provides no basis for citation of Hu as demonstrating a lack of 
utility of the claimed antibodies, where the Examiner agrees that Hu does not demonstrate a lack 
of utility for the encoding nucleic acid. 

Haynes (and Gygi, cited therein) investigated the relationship between steady state levels 
of mRNA for various genes and steady state levels of polypeptides of these various genes. 
Neither Haynes nor Gygi reported any data regarding the relationship between changes in 
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mRNA levels for a particular gene and the resultant levels of the encoded polypeptide. 
Accordingly, the factual teachings of Haynes are not relevant to Appellants asserted utility. 

Konopka teaches that changes in protein expression levels are related to changes in 
mRNA levels, which supports Appellants' asserted utility. The Examiner agrees with 
Appellants' characterization of Konopka, but nevertheless dismisses her own evidence as an 
"exception." 

In sum, two of the three cited references contain no factual teachings that are relevant to 
Appellants asserted utility. The third cited reference supports Appellants asserted utility. 
Despite the Examiner ignoring Konopka' s teachings in support of Appellants assertions, the 
cited references do not provide any factual evidence demonstrating that one of ordinary skill in 
the art would reasonably doubt Appellants' asserted utility. Moreover, the totality of the 
evidence cited by the Examiner supports Appellants asserted utility. 

The Examiner has stated: 

While one can find prior art that supports a 'significant probability' that mRNA 
and protein levels will correlate, there is influential prior art of record that 
requires the Examiner maintain that as a whole, the prior art does not provide a 
reasonable expectation that expression of the nucleic acid of SEQ ID NO:77 
positively correlates with the expression of the protein of SEQ ID NO: 78. Office 
Action dated June 22, 2005, at pages 4-5. 

As Appellants have explained, the "influential art" of record (Hu, Haynes and Konopka) 
is either silent regarding any relationship between mRNA and polypeptide levels (Hu), silent 
regarding the relationship between changes in mRNA levels and resultant polypeptide levels 
(Haynes), or supports Appellants assertion that changes in mRNA levels generally lead to similar 
changes in the levels of the corresponding polypeptide (Konopka). Accordingly, there is no 
factual basis in the record to conclude that this "influential art" "requires the Examiner maintain 
that as a whole, the prior art does not provide a reasonable expectation that expression of the 
nucleic acid of SEQ ID NO:77 positively correlates with the expression of the protein of SEQ ID 
NO:78." In view of the above, Appellants submit that the evidence cited by the Examiner does 
not demonstrate that one of ordinary skill in the art would reasonably doubt Appellants' asserted 
utility, and therefore, cannot support a prima facie rejection of the claims as lacking utility. 
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7. Even if the Examiner established a Prima Facie case. Appellants have provided 
Sufficient Rebuttal Evidence of Utility 

*'Only after the PTO provides evidence showing that one of ordinary skill in the art would 

reasonably doubt the asserted utility does the burden shift to the applicant to provide rebuttal 

evidence." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The rebuttal 

evidence must be sufficient such that when it is considered as a whole, it is more likely than not 

that the asserted utility is true. See In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 

1444 (Fed. Cir. 1992) (stating that the evidentiary standard to be used throughout ex parte 

examination in setting forth a rejection is a preponderance of the evidence, or "more likely than 

not" standard). The M.P.E.P. summarizes the standard of proof required: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P,E.P. § 2107.02, part VII (emphasis in original, 
citations omitted). 

Appellants remind the Board that the Federal Circuit has stated that the standard for satisfying 
the utility requirement is a low one: "The threshold of utility is not high: An invention is 'usefiil' 
under section 101 if it is capable of providing some identifiable benefit." Juicy Whip, Inc, v. 
Orange Bang, Inc., 185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999), 

Even if the Examiner has satisfied her burden of presenting a prima facie case of lack of 
utility. Appellants have supplied more than enough rebuttal evidence, such that when considered 
as a whole, one of skill in the art would conclude that the asserted utility is more likely than not 
true. As discussed in detail below. Appellants have provided sufficient evidence that the gene 

« 

encoding the PRO 1357 polypeptide is differentially expressed in certain cancers and can 
therefore be used as a diagnostic tool. In addition. Appellants have shown that it is well 
established in the art that there is a reasonable correlation between changes in mRNA level and 
changes in the corresponding polypeptide level such that one of skill in the art would believe that 
the PRO 1357 polypeptide is also differentially expressed in certain cancers. Therefore, 
considering the evidence as a whole, one of skill in the art would believe that it is more likely 
than not that the claimed antibodies eire useful as diagnostic tools for cancer, particularly 
stomach and lung tumors. 
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a. Appellants have established that the sene encodins the PRO 13 57 
polypeptide is differentially expressed in certain cancers 

As discussed above, the Examiner has not provided evidence or reasoning to challenge 
the reliability and significance of the data in Example 18 which reports that the mRNA for 
PRO 13 57 is more highly expressed in normal stomach and lung tissue compared to stomach and 
lung tumor, respectively. In contrast. Appellants' specification discloses gene expression data in 
Example 18 showing that the mRNA associated with protein PRO 1357 was more highly 
expressed in normal stomach and lung tissue compared to stomach and lung tumor, respectively. 
See Specification at \ [0530] and accompanying tables. Gene expression was analyzed using 
standard quantitative PGR amplification reactions of cDNA libraries isolated from different 
human tumor and normal human tissue samples. Id, It is well known in the art that the number 
of copies of a particular cDNA in the cDNA library is determined by the number of copies of the 
corresponding mRNA in the sample. Therefore, the cDNA libraries can be used to determine the 
level of expression of the corresponding mRNA in the tissue. 

Appellants have asserted that identification of the differential expression of the PRO 1357 
polypeptide-encoding gene in tumor tissue compared to the corresponding normal tissue renders 
the molecule useful as a diagnostic tool for the determination of the presence or absence of tumor. 
Id, In support of this asserted utility. Appellants submitted as Exhibit 1 to their Amendment and 
Response to Office Action mailed September 22, 2004, a first Declaration of J. Ghristopher 
Grimaldi, an expert in the field of cancer biology. This declaration explains the importance of 
the data in Example 18, and how differential gene and protein expression studies are used to 
differentiate between normal and tumor tissue. See First Grimaldi Declaration, 

In paragraphs 6 and 7, Mr. Grimaldi explains that the semi-quantitative analysis 
employed to generate the data of Example 1 8 is sufficient to determine if a gene is over- or 
under-expressed in tumor cells compared to corresponding normal tissue. He states that any 
visually detectable difference seen between two samples is indicative of at least a two-fold 
difference in cDNA between the tumor tissue and the counterpart normal tissue. He also states 
that the results of the gene expression studies indicate that the genes of interest "can be used to 
differentiate tumor from normal." He explains that, contrary to the PTO's assertions, "[t]he 
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precise levels of gene expression are irrelevant; what matters is that there is a relative difference 
in expression between normal tissue and tumor tissue." First Grimaldi Declaration at T| 7. 

This declaration makes clear that since it is the relative level of expression between 
normal tissue and suspected cancerous tissue that is important, how high the level of expression 
in normal tissue is, is irrelevant. As to the experimental conditions and the significance of the 
results, Appellants employed standard techniques which are well-known and accepted by those 
of skill in the art, and Mr. Grimaldi states that the samples are pooled samples of normal and 
tumor tissue, and therefore are more reliable than individual samples. Id at T[ 5. Mr. Grimaldi 
states that if a difference is detected using these techniques, "this indicates that the gene and its 
corresponding polypeptide and antibodies against the polypeptide are useful for diagnostic 
purposes..." Id. at 7. Thus, it is the uncontested opinion of an expert in the field that the results 
are reliable enough to indicate that the claimed antibodies are useful as diagnostic tools. 

The data in Example 18 and the first Grimaldi Declaration provide strong support for 
Appellants' asserted utility. Mr. Grimaldi is an expert in the field who conducted or supervised 
the experiments at issue. His declaration is based on personal knowledge of the relevant facts at 
issue. Appellants' have reminded the Examiner that "Office personnel must accept an opinion 
from a qualified expert that is based upon relevant facts whose accuracy is not being 
questioned." M.P.E.P. § 2107 (emphasis added). In addition, declarations relating to issues of 
fact should not be summarily dismissed as "opinions" without an adequate explanation of how 
the declaration fails to rebut the Examiner's position. See in re Alton 76 F.3d 1168 (Fed. Cir. 
1996). The Examiner has offered no reason or evidence to reject either the underlying data or 
Mr. Grimaldi's conclusions. Therefore, the Examiner should accept Mr. Grimaldi's opinion with 
regard to his statement that "any visually detectable difference seen between two samples is 
indicative of at least a two-fold difference in cDNA between the tumor tissue and the counterpart 
normal tissue" and that the genes of interest "can be used to differentiate tumor from normal." 

Given the disclosure of Example 1 8 and the supporting first Grimaldi declaration on the 
one hand, and the lack of any evidence on the other, it is clear that considering the evidence as a 
whole, one of skill in the art would conclude that it is more likely than not that the PRO 1357 
gene is differentially expressed in lung and stomach tumor tissue compared to their normal tissue 
counterparts such that is usefiil as a diagnostic tool to distinguish tumor tissue from normal tissue. 
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As Appellants explain below, it is more likely than not that the PR01357 polypeptide is 
also differentially expressed in lung and stomach tumor tissue, and can therefore be used to 
distinguish tumor tissue from normal tissue. This provides utility for the claimed antibodies. 

b. Appellants have established that senerallv there is a correlation between 
chanses in mRNA expression levels and chan2es in the expression level 
of the encoded polypeptide 

Appellants next turn to the second portion of their argument in support of their asserted 
utility - that it is well-established in the art that in most cases a change in the level of mRNA for 
a particular polypeptide leads to a corresponding change in the level of the encoded polypeptide. 
Given Appellants' evidence of differential expression of the mRNA for the PR01357 
polypeptide in lung and stomach tumor, it is more likely than not that the PRO 1357 polypeptide 
is likewise differentially expressed, and therefore the claimed antibodies are useful as diagnostic 
tools, particularly for lung and stomach txmior. 

In support of the assertion that changes in mRNA are positively correlated to changes in 
protein levels, Appellants submitted a second Declaration by J. Christopher Grimaldi, an expert 
in the field of cancer biology (originally submitted as Exhibit 2 with the Appellants' Amendment 
and Response to Office Action mailed September 22, 2004). As stated in paragraph 5 of the 
declaration, "Those who work in this field are well aware that in the vast majority of cases, when 
a gene is over-expressed... the gene product or polypeptide will also be over-expressed.... This 
same principal applies to gene under-expression." Second Grimaldi Declaration at 5. Further, 
"increased mRNA expression is expected to result in increased polypeptide expression, and the 
detection of decreased mRNA expression is expected to result in decreased polypeptide 
expression." Id, 

Appellants also submitted the declaration of Paul Polakis, Ph.D. an expert in the field of 

cancer biology (attached as Exhibit 3 to Appellants' Amendment and Response to Office Action 

mailed September 22, 2004). As stated in paragraph 6 of his declaration: 

Based on my own experience accumulated in more than 20 years of research, 
including the data discussed in paragraphs 4 and 5 above [showing a positive 
correlation between mRNA levels and encoded protein levels in the vast majority 
of cases studied in relation to the present invention] and my knowledge of the 
relevant scientific literature, it is my considered scientific opinion that for human 
genes, an increased level of mRNA in a tumor cell relative to a normal cell 
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typically correlates to a similar increase in abundance of the encoded protein in 
the tumor cell relative to the normal cell. In fact, it remains a central dogma in 
molecular biology that increased mRNA levels are predictive of corresponding 
increased levels of the encoded protein. Polakis Declaration at TI 6 (emphasis 
added). 

Dr. Polakis acknowledges that there are published cases where such a correlation does not exist, 
but states that it is his opinion, based on over 20 years of scientific research, that ''such reports 
are exceptions to the commonly understood general rule that increased mRNA levels are 
predictive of corresponding increased levels of the encoded protein." Polakis Declaration at ^1 6. 

The statements of Grimaldi and Polakis are supported by the teachings in Molecular 
Biology of the Cell, a leading textbook in the field (Alberts, et al. Molecular Biology of the Cell 
(3^^ ed. 1994) (submitted with Appellants' Amendment and Response to Final Office Action and 
Request for Continued Examination mailed April 1, 2005 as Exhibit 1, hereinafter "Cell 3 ") and 
(4^*^ ed. 2002) (submitted with Appellants' Amendment and Response to Final Office Action 
mailed February 10, 2005 as Exhibit 1, hereinafter "Cell 4'^")). Figure 9-2 of Cell 3'^ shows the 
steps at which eukaryotic gene expression can be controlled. The first step depicted is 
transcriptional control. Cell 3 provides that "[fjor most genes transcriptional controls are 
paramount. This makes sense because, of all the possible control points illustrated in Figure 9-2, 
only transcriptional control ensures that no superfluous intermediates are synthesized." Cell 3''^ 
at 403 (emphasis added). In addition, the text states that "Although controls on the initiation of 
gene transcription are the predominant form of regulation for most genes, other controls can act 
later in the pathway from RNA to protein to modulate the amount of gene product that is made." 
Cell 3 at 453 (emphasis added). Thus, as established in Cell 3 , the predominant mechanism 
for regulating the amount of protein produced is by regulating transcription. 

In Cell 4^^, Figure 6-3 on page 302 illustrates the basic principle that there is a correlation 
between increased gene expression and increased protein expression. The accompanying text 
states that "a cell can change (or regulate) the expression of each of its genes according to the 
needs of the moment - most obviously by controlling the production of its mRNA" Cell 4^^ at 
302 (emphasis added). Similarly, Figure 6-90 on page 364 of Cell 4^*^ illustrates the path from 
gene to protein. The accompanying text states that while potentially each step can be regulated 
by the cell, " the initiation of transcription is the most common point for a cell to regulate the 
expression of each of its genes ." Cell 4^^ at 364 (emphasis added). This point is repeated on 
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page 379, where the authors state that of all the possible points for regulating protein expression, 
" [f|or most genes transcriptional controls are paramount ." Cell at 379 (emphasis added). 

Further support for Appellants' position can be found in the textbook, Genes VI, 
(Benjamin Lewin, Genes VI (1997)) (submitted with Appellants' Amendment and Response to 
Final Office Action mailed February 10, 2005 as Exhibit 2) which states "having acknowledged 
that control of gene expression can occur at multiple stages, and that production of RN A cannot 
inevitably be equated with production of protein, it is clear that the overwhelming majority of 
regulatory events occur at the initiation of transcription ." Genes VI at 847-848 (emphasis added). 

Additional support is also found in Zhigang et al.. World Journal of Surgical Oncology 
2:13, 2004 (submitted with Appellants' Amendment and Response to Final Office Action mailed 
February 10, 2005 as Exhibit 3). Zhigang studied the expression of prostate stem cell antigen 
(PSCA) protein and mRNA to validate it as a potential molecular target for diagnosis and 
treatment of human prostate cancer. The data showed "a high degree of correlation between 
PSCA protein and mRNA expression" Zhigang at 4. Of the samples tested, 81 out of 87 showed 
a high degree of correlation between mRNA expression and protein expression. The authors 
conclude that "it is demonstrated that PSCA protein and mRNA overexpressed in human prostate 
cancer, and that the increased protein level of PSCA was resulted firom the upregulated 
transcription of its mRNA." Id at 6. Even though the correlation between mRNA expression 
and protein expression occurred in 93% of the samples tested, not 100%, the authors state that 
"PSCA may be a promising molecular marker for the clinical prognosis of human Pea and a 
valuable target for diagnosis and therapy of this tumor." Id, at 7. 

Further, Meric et aL, Molecular Cancer Therapeutics, vol. 1, 971-979 (2002), (submitted 

with Appellants' Amendment and Response to Final Office Action mailed February 10, 2005 as 

Exhibit 4), states the following: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression at the 
level of mRNA, which can be attributable to either DNA amplification or to 
differences in transcription. Meric et al at 971 (emphasis added). 

Exploiting differences in gene expression between cancer cells and normal cells would not be a 
"fundamental principle" of molecular cancer therapeutics if there were no significant correlation 
between gene expression and protein levels. Stated another way, changes in mRNA without 
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corresponding changes in protein levels would have little or no effect on cellular biology, and 
those of skill in the art would have no reason to examine the differences in gene expression at the 
mRNA level without such a correlation. However, as one of skill in the art recognizes, there is a 
strong correlation between changes in mRNA and changes in protein level. It is because of this 
strong correlation that it remains a "fundamental principle" of molecular therapeutics in cancer 
to look at changes in mRNA level. 

Together, the declarations of Grimaldi and Polakis, the accompanying references, and the 
excerpts and references discussed above all establish that the accepted understanding in the art is 
that there is a reasonable correlation between changes in gene expression and changes in the 
level of the encoded protein. In contrast to this substantial amount of evidence supporting 
Appellants' position, the Examiner has cited two references, Haynes et al and Konopka et al\ 
moreover, the Examiner agrees that Konopka et al supports Appellants assertions. Thus, the 
Examiner provides only a single reference, Haynes, as countervailing Appellants' evidence. As 
discussed above, Haynes is not relevant to the issue of whether a change in mRNA levels leads 
to a change in the level of the corresponding protein. Therefore, it is clear that when considered 
as a whole, the preponderance of the evidence clearly weighs in favor of Appellants. 

Appellants have presented sufficient evidence to establish that the mRNA for PRO 1357 
is differentially expressed in stomach and lung tumors compared to their normal tissue 
counterparts, and that it is more likely than not that this leads to differential expression of the 
PR01357 polypeptide. This makes the claimed antibodies useful for diagnosing cancer, 
particularly stomach and lung tumors. Given the overwhelming amount of evidence in support 
of Appellants' position, and the near absence of any evidence in support of the Examiner's 
position, when considered as a whole, the evidence leads a person of ordinary skill in the art to 
conclude that the asserted utility is more likely than not true. 

c. The asserted utility is specific 

Finally, Appellants address the Examiner's argument that the asserted utilities are not 
specific to the claimed antibodies to PR01357. 

Specific Utility is defined as utility which is "specific to the subject matter claimed," in 
contrast to "a general utility that would be applicable to the broad class of the invention." 
M.P.KP. § 2107.01 I. Appellants submit that the evidence of differential expression of the 
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PRO 1357 gene and polypeptide in certain types of tumor cells, along with the declarations and 
references discussed above, provide a specific utility for the claimed antibodies. 

As discussed above, there are significant data which show that the gene for the PRO 1357 
polypeptide is expressed at least two-fold higher in normal lung and stomach tissue compared to 
lung and stomach tumor, respectively. These data are strong evidence that the PRO 1357 gene 
and polypeptide are associated with lung and stomach tumors. Thus, contrary to the assertions of 
the Examiner, Appellants have provided evidence associating the PRO 1357 gene and 
polypeptide with specific diseases. The asserted utility for the claimed antibodies as diagnostic 
tools for cancer, particularly lung and stomach tumor, is a specific utility - it is not a general 
utility that would apply to the broad class of antibodies. 

8. The Examiner Response to Appellants^ Evidence is Insufficient to Rebut 
Appellants * Arguments 

The Examiner has concluded that the teachings of the specification, the declarations of 
Grimaldi and Polakis and accompanying references, the excerpts and references, and supporting 

■ft 

arguments provided by Appellants are not persuasive. 

a. The Examiner^s response to the First Grimaldi Declaration 

The Examiner has dismissed the first Grimaldi Declaration because the disclosure in the 

specification is insufficient to overcome the utility rejection. The Examiner states: 

The conclusory statement of Grimaldi of the necessary existence of an at least 
two-fold differentiation in nucleic acid expression does not support a utility for or 
enable the invention because it does not fill important gaps in the disclosure 
needed to use the invention without significant further experimentation . . . While 
a 'relative difference in expression between normal tissue and suspected 
cancerous tissue' can be informative, without more specifics about necessary 
sample size, expression level range for normal and tumor tissues, types of 
stomach or lung tissue that can [be] used, and other questions, the specification 
has not provided the invention in an enabling form. Office Action dated June 22, 
2005, at pages 6-7. 

Appellants submit that, as discussed above, the burden is not on Appellants to provide 

"specifics about necessary sample size, expression level range for normal and tumor tissues, 

types of stomach or lung tissue that can [be] used" {Id. at page 7) in order to initially establish 

utility. Instead, the burden is on the Examiner to provide "evidence showing that one of ordinary 

-32- 



Appl, No. : 10/063,587 

Filed : May 3, 2002 



skill in the art would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 
U.S.P.Q.2d 1436 (Fed. Cir. 1995). 

The Examiner's rejection relies on placing the burden on Appellants to provide sufficient 
"specifics" in order to establish utility. As a result, the Examiner dismisses the Grimaldi 
reference "because it does not fill important gaps in the disclosure needed to use the invention." 
The Examiner does not rely on any evidence or reasoning to demonstrate why the specification 
must provide these enumerated experimental specifics, or to dismiss the Grimaldi declaration as 

i 

not filling "important gaps in the disclosure." Instead, the Examiner erroneously places the 
burden on Appellants' specification to initially establish the utility of the claimed antibodies, and 
disregards any statements in the Grimaldi declaration as not providing the information required 
of the specification. As discussed above, such a basis for a utility rejection is inconsistent with 
PTO policy and the guidelines provided by the courts. Moreover, statements of a Declaration are 
not to be dismissed without evaluation of the statements provided. "Office personnel must 
accept an opinion from a qualified expert that is based upon relevant facts whose accuracy is not 
being questioned." M.P.E.P, § 2107 (emphasis added). In addition, declarations relating to 
issues of fact should not be summarily dismissed as "opinions" without an adequate explanation 
of how the declaration fails to rebut the Examiner's position. See in re Alton 76 F.3d 1 168 (Fed. 
Cir. 1996). The Examiner has offered no reason or evidence to reject either the underlying data 
or Mr. Grimaldi's conclusions. Therefore, the Examiner should accept Mr. Grimaldi's opinion 
with regard to his statement that "any visually detectable difference seen between two samples is 
indicative of at least a two-fold difference in cDN A between the tumor tissue and the counterpart 
normal tissue" and that the genes of interest "can be used to differentiate tumor from normal." 

Furthermore, the Examiner's criticisms of the specification and the Grimaldi declaration 
are based on assertions that the specification provides insufficient disclosure of differential 
nucleic acid expression. The Examiner has accepted Appellants differential expression evidence 
as supporting a utility for nucleic acids encoding the PR01357 polypeptide. Thus, the 
Examiner's basis for indicating failings of the specification and for dismissing the Grimaldi 
declaration are inconsonant with the Examiner's acceptance of the utility of nucleic acids 
encoding the PR01357 polypeptide. Accordingly, the Examiner's position is not even supported 
by the Examiner's own reasoning. 

Also regarding the Grimaldi declaration, the Examiner states: 
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The declaration also says (f 5) that 'Data from a pooled sample are more likely to 
be accurate that [sic] data from a single individual.' This bags [sic] the question 
of whether the tissue from an individual could be assessed for whether or not it is 
cancerous. Clinical diagnostics are not usually geared toward a populous but 
toward an individual's particular condition. Office Action dated June 22, 2005, at 
page 7. 

Thus, the Examiner is alleging that results from pooled samples have questionable 
relevance in clinical diagnostics because evidence of differential expression in a populous is not 
directed toward diagnosing an individual's particular condition. If it were true that a disease 
indicator, such as differential expression, from a population were not relevant to diagnosing 
disease in an individual, the concept of diagnosis would not exist. Appellants submit that 
diagnosis of an individual's particular disease condition is based on disease indicators derived 
from characteristics of a populous. The Examiner's rationale for criticizing the results from 
pooled samples would render any population-based disease-indicative information irrelevant. 
Appellants submit that disease-indicative information, such as differential expression, from a 
generalized population does provide a useftil tool for diagnosing an individual particular 
condition because the information is "likely to represent a more generally relevant condition." 
First Grimaldi Declaration^ at T| 5. Accordingly, insofar as the Examiner holds that the data of 
the specification and supporting evidence of the Grimaldi declaration are insufficient because 
they are not useful for diagnostic purposes, Appellants submit that the Examiner's basis for these 
holdings is unfounded. 



b. The Examiner response to the Polakis Declaration and the Second 
Grimaldi Declaration 
The Examiner provides three reasons for dismissing the Polakis Declaration: 

First, it is important to note that the insteint specification provides no information 
regarding decreased mRNA levels of PRO 1357 in tumor samples relative to 
normal samples. Only relative gene expression data was presented. Second, the 
declaration does not provide data such that the Examiner can independently draw 
conclusions. Only Dr. Polakis' conclusions are provided in the declaration. 
There is no evidentiary support to Dr. Polakis' statement that it remains central 
dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded polypeptide. Finally, it is noted 
that the literature cautions researchers from drawing conclusions based on small 
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changes in transcript expression levels between normal and cancerous tissue 
[citing Hu et al.]. Office Action dated December 13, 2004, at page 5. 

No subsequent Action from the Examiner provided any further reason for dismissing the 
Polakis Declaration. In contrast to the above, the most recent Office Action, dated June 22, 2005, 
has stated that "it is agreed that the polynucleotide of SEQ ID NO:77 has this specific utility" 
and the "encoding nucleic acid of SEQ ID NO:77 appears to be underexpressed in stomach and 
lung tumors." Office Action dated June 22, 2005, at pages 10 and 6. Thus, the first of the 
Examiner's reasons, "the instant specification provides no information regarding decreased 
mRNA levels of PR01357 in tumor samples relative to normal samples," is no longer consistent 
with the Examiner's present position. Further, the third of the Examiner's reasons, "the literature 
cautions researchers from drawing conclusions based on small changes in transcript expression 
levels between normal and cancerous tissue" also is inconsistent with the Examiner's recognition 
of utility of nucleic acids encoding the PRO 1357 polypeptide based on the changes in transcript 
expression levels between normal and cancerous tissue disclosed in Example 18. Further, the 
Examiner's third reason is based on Hu et al, which, as Appellants have discussed above, is not 
relevant to Appellants' asserted utility. 

Thus, the only reason previously provided by the Examiner for dismissing the Polakis 
declaration which is not inconsistent with the Examiner's current position is that "the declaration 
does not provide data such that the Examiner can independently draw conclusions." Appellants 
again emphasize that statements of a Declaration are not to be dismissed without evaluation of 
the statements provided. "Office personnel must accept an opinion from a qualified expert that is 
based upon relevant facts whose accuracy is not being questioned." M.P.E.P, § 2107 (emphasis 
added). In addition, declarations relating to issues of fact should not be summarily dismissed as 
"opinions" without an adequate explanation of how the declaration fails to rebut the Examiner's 
position. See in re Alton 76 F.3d 1168 (Fed. Cir. 1996). The Examiner's dismissal of the 
Polakis declaration is not based on any reason or evidence other than a demand for "data such 
that the Examiner can independently draw conclusions," which is an impermissible basis for 
dismissing a declaration. Accordingly, the Examiner does not provide sufficient basis for 
dismissing the Polakis declaration. Furthermore, the Examiner does not provide any reasoning 
questioning any of the statements within the Polakis declaration. As such, the statements of the 
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Polakis declaration have not been validly challenged by the Examiner, and, therefore, there is 
nothing in the record to controvert the Polakis declaration. 

Regarding the second Grimaldi declaration, the Examiner points only to statements 
discussing rare cases where protein expression does not correlate with mRNA expression. The 
Examiner is completely silent regarding the statement that "[TJhose who work in this field are 
well aware that in the vast majority of cases, when a gene is over-expressed... the gene product 
or polypeptide will also be over-expressed.... This same principal applies to gene under- 
expression." Second Grimaldi Declaration at T| 5. No indication is provided by the Examiner 
whether or not the Examiner agrees with this statement. Moreover, no reasoning or evidence 
whatsoever has been offered by the Examiner to question these statements in the second 
Grimaldi declaration. As such, the above statements of the second Grimaldi declaration have not 
been challenged by the Examiner, and, therefore, there is nothing in the record to controvert 
these statements of the second Grimaldi declaration. 

c. The Examiner's response to the Various Publications Submitted 

Appellants have submitted, inter alia, textbook excerpts and research publications in 
further support of the position that changes in mRNA levels are typically accompanied by 
changes in levels of the encoded polypeptide. In response, the Examiner has stated, regarding all 
such references: 

The argument has been fully considered, but is not persuasive. As discussed 
above, there is sound supporting evidence showing the unpredictability of saying 
level of expression of a particular nucleic acid will correlate with expression of 
the encoded protein. Office Action dated June 22, 2005, at page 9. 

As discussed above, of the evidence presented by the Examiner, only the Haynes and 
Konopka references discuss mRNA levels in relation to polypeptide levels, and Konopka 
squarely supports Appellants' assertions. Thus the Examiner's "sound supporting evidence" is a 
single reference, Haynes. The Examiner uses the teachings of this single reference to dismiss 
numerous teachings in textbooks and in research articles in support of the assertion that changes 
in mRNA levels are typically accompanied by changes in levels of the encoded polypeptide. 

Appellants submit that even if Haynes supported the Examiner's position, which it does 
not, dismissal of textbooks and research articles, including dismissal of the Examiner's own 
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evidence, in view of a single research article, does not represent full consideration of the totality 
of the record in fairly evaluating the utility of the claimed antibodies. 
As provided in the M.P.E.P.: 

It is essential for Office personnel to recognize, fully consider and respond to each 
substantive element of any response to a rejection based on lack of utility. Only 
where the totality of the record continues to show that the asserted utility is not 
specific, substantial, and credible should a rejection based on lack of utility be 
maintained. M.P,KP, §2107 

Appellants have submitted Declarations, textbook excerpts and research articles in 
support of Appellants assertion that one skilled in the art would have believed that changes in 
mRNA levels are typically accompanied by changes in levels of the encoded polypeptide. 
Further, the Examiner has presented Konopka, which supports Appellants' position. To contest 
Appellants' assertions, the Examiner presented Haynes, which the Examiner alleges teaches the 
unpredictability of polypeptide levels compared to mRNA levels. Thus in fully considering the 
totality of the record, the Examiner places more weight on the single reference by Haynes than 
on all of Appellants' evidence and the Konopka reference cited by the Examiner herself. 
Appellants submit that, contrary to the Examiner's holding, the totality of the record shows that 
the asserted utility is specific, substantial, and credible, and the Examiner's holding otherwise 
does not represent a fair and full consideration of the totality of the record, as required. 

9. The Courts have held that the Utility Requirement was Satisfied in Similar 
Cases 

The seminal decision interpreting the utility requirement of 35 U.S.C. § 101 is Brenner v. 
Manson, 383 U.S. 519, 148 U.S.P.Q. 689 (1966). At issue in Brenner was a claim to "a 
chemical process which yields an already known product whose utility - other than as a possible 
object of scientific inquiry - ha[d] not yet been evidenced." Id at 529, 148 U.S.P.Q. at 693. The 
Patent Office rejected the claimed process for lack of utility because the product produced by the 
claimed process had no known use. See id. at 521-22, 148 U.S.P.Q. at 690. On appeal, the Court 
of Customs and Patent Appeals reversed, holding "where a claimed process produces a known 
product it is not necessary to show utility for the product." Id, at 522, 148 U.S.P.Q. at 691. 

In reviewing the lower court's decision, the Court made its oft quoted statement that 
"[t]he basic quid pro quo contemplated by the Constitution and the Congress for granting a 
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patent monopoly is the benefit derived by the public from an invention with substantial utility. 
Unless and until a process is refined and developed to this point - where specific benefit exists in 
currently available form - there is insufficient justification for permitting an Appellant to engross 
what may prove to be a broad field." Id, at 534-35, 148 U.S.P.Q. at 695. 

The first opinion of the C.C.P.A. applying Brenner was In re Kirk, 376 F.2d 936, 153 
U.S.P.Q. 48 (C.C.P.A. 1967). The invention claimed in Kirk was a set of steroid derivatives said 
to have valuable biological properties and to be of value "in the furtherance of steroidal research 
and in the application of steroidal materials to veterinary or medical practice." Id. at 938, 153 
U.S.P.Q. at 50. In affirming the claim rejection based on a lack of utility, the court held that the 
"nebulous expressions 'biological activity' or 'biological properties'" did not adequately convey 
how to use the claimed compounds." Id, at 941, 153 U.S.P.Q. at 52. The court also rejected 
Appellants' supporting affidavit, stating, "the sum and substance of the affidavit appears to be 
that one of ordinary skill in the art would know 'how to use' the compounds to find out in the 
first instance whether the compounds are - or are not - in fact useftil or possess useful properties, 
and to ascertain what those properties are." Id, at 942, 153 U.S.P.Q. at 53. 

Since these early decisions, the courts have continued to clarify what is sufficient to 
satisfy the utility requirement. Three more recent decisions are of particular relevance to the 
instant application: Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. 881 (C.C.P.A. 1980), Cross v. 
lizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 (Fed. Cir. 1985), and Fujikawa v. Wattanasin, 93 F.3d 
1559, 39 U.S.P.Q. 2d 1895 (Fed. Cir. 1996). 

The earliest of these cases, Nelson v. Bowler, involved an interference between two 
applications related to derivatives of naturally occurring prostaglandins (PG). Nelson, 626 F.2d 
at 854-55. The issue was whether Nelson had shown at least one utility for the compounds at 
issue to establish an actual reduction to practice. Id, at 855. The Appellants relied on two tests 
to prove practical utility: an in vivo rat blood pressure (BP) test and an in vitro gerbil colon 
smooth muscle stimulation (GC-SMS) test. In the BP test, the blood pressure of anesthetized 
rats was recorded on a polygraph chart to determine whether an injected compound had any 
effect. Responses were categorized as either a depressor (lowering) effect or a pressor 
(elevating) effect. Id, In the GC-SMS test a section of colon was excised from a freshly-killed 
gerbil for suspension in a physiological solution, and a lever arm was connected to the colon in 
such a way that any contraction was recorded as a polygraph trace. Id. The Board held that 
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Nelson had not shown adequate proof of practical utility, characterizing the tests as "rough 

screens, uncorrelated with actual utility." Id, at 856. 

On appeal the C.C.P.A. reversed, holding that the Board "erred in not recognizing that 

tests evidencing pharmacological activity may manifest a practical utility even though they may 

not establish a specific therapeutic use." Id. The Court stated that "practical utility" was 

characterized as a use of the claimed discovery in a manner which provides some immediate 

benefit to the public, establishing the following rule: 

Knowledge of the pharmacological activity of any compound is obviously 
beneficial to the public. It is inherently faster and easier to combat illnesses and 
alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities. Since it is crucial to provide 
researchers with an incentive to disclose pharmacological activities in as many 
compounds as possible, we conclude that adequate proof of any such activity 
constitutes a showing of practical utility. Id, (emphasis added). 

The Court rejected Bowler's argument that the BP and GC-SMS tests are inconclusive 
showings of pharmacological activity since confirmation by statistically significant means did 
not occur until after the critical date. The Court stated that "a rigorous correlation is not 
necessary where the test for pharmacological activity is reasonably indicative of the desired 
response." Id. (emphasis added). The Court concluded that a " reasonable correlation " between 
the observed properties and the suggested use was sufficient to establish practical utility. Id. at 
857. 

The sufficiency of a "reasonable correlation" in establishing utility was affirmed by the 
Court of Appeals for the Federal Circuit in Cross v. lizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 
(Fed. Cir. 1985). In Cross, the subject of the interference before the Court was imidazole 
derivative compounds which inhibit the synthesis of thromboxane synthetase, an enzyme which 
leads to the formation of thromboxane A2. At the time the applications were filed, 
thromboxane A2 was postulated to be involved in platelet aggregation, which was associated 
with several deleterious conditions. Id. at 1042. 

The question before the Board and reviewed by the Court was whether lizuka was 
entitled to the benefit of his Japanese priority application. Id. The Japanese application 
disclosed that the imidazole derivatives showed strong inhibitory action for thromboxane 
synthetase from human or bovine platelet microsomes, an in vitro utility. Id. at 1043. Relying in 
part on Nelson, the Board held that tests evidencing pharmacological activity may manifest a 
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practical utility even though they may. not establish a specific therapeutic use, and concluded that 
the in vitro tests were sufficient to establish a practical utility. Id 

On appeal, Cross argued that the basic in vitro tests conducted in cellular fractions did not 
establish a practical utility for the claimed compounds, and that more sophisticated in vitro or in 
vivo tests were necessary to estabhsh a practical utility. Id at 1050. The Court rejected this 
argument, noting that adequate proof of any pharmaceutical activity constitutes a showing of 
practical utility. Id. The Court accepted the argument that initial testing of compounds is widely 
done in vitro: 

[I\n vitro results... are generally predictive of in vivo test results, i.e., there is a 
reasonable correlation therebetween. Were this not so, the testing procedures of 
the pharmaceutical industry would not be as they are. lizuka has not urged, and 
rightly so, that there is an invariable exact correlation between in vitro test results 
and in vivo test results. Rather, lizuka' s position is that successful in vitro testing 
for a particular pharmacological activity establishes a significant probability that 
in vivo testing for this particular pharmacological activity will be successful. Id, 
(emphasis added). 

The Court also noted that in previous decisions, its predecessor court had accepted 

evidence of in vivo utility as sufficient to establish practical utility. The Court reasoned that: 

This in vivo testing is but an intermediate link in a screening chain which may 
eventually lead to the use of the drug as a therapeutic agent in humans. We 
perceive no insurmountable difficulty, under appropriate circumstances, in finding 
that the first link in the screening chain, in vitro testing, may establish a practical 
utility for the compound in question. Successful in vitro testing will marshal 
resources and direct the expenditure of effort to further in vivo testing of the most 
potent compounds, thereby providing an immediate benefit to the public, 
analogous to the benefit provided by the showing of an in vivo utility . Id. at 1051, 
citing Nelson, 626 F.2d at 856 (emphasis added). 

Based on this reasoning, the Court affirmed the decision of the Board, stating that "based 
upon the relevant evidence as a whole, there is a reasonable correlation between the disclosed in 
vitro utility and an in vivo activity, and therefore a rigorous correlation is not necessary where 
the disclosure of pharmacological activity is reasonable based upon the probative evidence." Id. 
at 1050 (emphasis added). The Court therefore held that the disclosed in vitro utility was 
"sufficient to comply with the practical utility requirement of § 101." Id. at 1051. 

The holdings of Nelson and Cross were more recently affirmed in Fujikawa v. 

Wattanasin, 93 F.3d 1559, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996). In Fujikawa, the Court again 

affirmed the notion that initial screens of compounds provide a practical utility even though they 
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may not provide a therapeutic use because '"[i]t is inherently faster and easier to combat 

illnesses and alleviate symptoms when the medical profession is armed with an arsenal of 

chemicals having known pharmacological activities.'" Id, at 1564, quoting Nelson, 626 F.2d at 

856. The Court noted that it may be difficult to predict whether novel compounds will exhibit 

pharmacological activity, and consequently testing is often required to establish practical utility. 

Id However the Court went on to state: 

But the test results need not absolutely prove that the compound is 
pharmacologically active. All that is required is that the tests be ''reasonably 
indicative of the desired [pharmacological] response." In other words, there must 
be a sufficient correlation between the tests and an asserted pharmacological 
activity so as to convince those skilled in the art, to a reasonable probability , that 
the novel compound will exhibit the asserted pharmacological behavior." Id. 
(internal citations omitted, underline emphasis added, italics in original). 

On appeal, Fujikawa argued that Wattanasin had failed to establish an adequate 
correlation between the in vitro and in vivo results to permit Wattanasin to rely on positive in 
vitro results to establish a practical utility. The Court stated that the Board relied on testimony 
from those skilled in the art that the in vitro results convinced the experts that the claimed 
compounds would exhibit the desired pharmacological activity when administered in vivo, 
including testimony that in vivo activity is typically highly correlatable to a compound's in vitro 
activity in the field. Id, at 1565, To overcome this evidence and counter the Board's decision, 
Fujikawa pointed to the testimony of its expert that "there is a reasonable element of doubt that 
some elements may be encountered which are active in the in vitro assay, but yet inactive in the 
in vivo assay." Id. 

The Court rejected this argument: "Of course, it is possible that some compounds active 
in vitro may not be active in vivo. But, as our predecessor court in Nelson explained, a 'rigorous 
correlation' need not be shown in order to establish practical utility: 'reasonable correlation' 
suffices ." Id. (emphasis added). The Court also rejected Fujikawa's reliance on two articles. 
The Court noted that while one article taught that "m vitro testing is sometimes not a good 
indicator of how potent a compound will be in vivo, it does imply that compounds which are 
active in vitro will normally exhibit some in vivo activity." Id. at 1566. Similarly, the Court 

■ 

noted that the second article expressly stated that "[fjor most substances, although not for all, the 
relative potency determined in in vitro ... parallels the in vivo activity." Id. 
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The Court concluded that the facts in the case were analogous to the ones in Cross where 
the court relied on a known reasonable correlation between in vitro tests and in vivo activity, and 
therefore affirmed the Board's decision that Wattanasin had established a practical utility with 
the in vitro results. M at 1565-66. 

The Nelson, Cross, and Fujikawa cases are very similar to the present case. The 
reasoning of the courts in all three cases that "'[i]t is inherently faster and easier to combat 
illnesses and alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities'" applies to the asserted utility for the 
claimed antibodies. Fujikawa, 93 F.3d at 1564, quoting Nelson, 626 F.2d at 856; see also Cross, 
753 F.2d at 1051 ("Successful in vitro testing will marshal resources and direct the expenditure 
of effort to further in vivo testing of the most potent compounds, thereby providing an immediate 
benefit to the public, analogous to the benefit provided by the showing of an in vivo utility."). 
Like pharmaceutical compovmds, nucleic acids, polypeptides, and antibodies which are 
associated with cancer will make it inherently faster and easier to combat cancer. The greater the 
number of biological markers of cancer medical professionals have access to, the more accurate 
and detailed a diagnosis they can make. The determination that a gene is differentially expressed 
in cancer constitutes at least as significant a development in the field of cancer diagnostics as in 
vitro screening for pharmaceutical activity. See Cross, 753 F.2d at 1051 ("the first link in the 
screening chain, in vitro testing, may establish a practical utility for the compound in question. 
Successful in vitro testing will marshal resources and direct the expenditure of effort to further in 
vivo testing of the most potent compounds, thereby providing an immediate benefit to the 
public"). 

In addition, like in vitro tests in the pharmaceutical industry, those of skill in the field of 
biotechnology rely on the reasonable correlation that exists between gene expression and protein 
expression (see discussion supra). Were there no reasonable correlation between the two, the 
techniques that measure gene levels such as microarray analysis, differential display, and 
quantitative PGR would not be so widely used by those in the art. See Second Grimaldi 
Declaration at T[ 5. As in Cross, Appellants here do not argue that there is "an invariable exact 
correlation" between gene expression and protein expression. See Cross, 753 F.2d at 1050. 
Instead, Appellants' position detailed above is that a measured change in gene expression in 
cancer cells establishes a "significant probability" that the expression of the encoded polypeptide 
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in cancer will also be changed based on "a reasonable correlation therebetween," Id ; see also 
Fujikawa^ 93 F.3d at 1565 ("a 'rigorous correlation' need not be shown in order to establish 
practical utility; 'reasonable correlation' suffices"); Nelson, 626 F.2d at 857 (holding that "a 
rigorous correlation is not necessary" and that a "reasonable correlation" will suffice). 

Also of importance is the Court's rejection of the notion that any in vitro testing must be 
statistically significant to support a practical utility. Nelson^ 626 F.2d at 857. Likewise, 
qualitative characterizations of a test compound as either increasing or decreasing blood pressure 
was acceptable. Id at 855 (stating that responses were categorized as either a depressor 
(lowering) effect or a pressor (elevating) effect). This is similar to the data in Example 18, 
where the change in mRNA levels is described as "more highly expressed." 

The Examiner has stated that "[t]he instant situation is directly analogous to that which 
was addressed in Brenner v. Manson, 148 U.S.P.Q. 689 (Sus. [sic] Ct, 1966)." Office Action 
dated June 22, 2005, at pages 5-6. However, the Examiner points to no facts whatsoever in 
either Brenner v. Manson or the present case in support of the assertion that they are "directly 
analogous." Appellants submit that the present case is not similar to Brenner v. Manson. In 
Brenner v. Manson, the claimed subject matter at issue was a process for making a product with 
no known utility. In the present case, the claimed subject matter is directed to antibodies that are 
useful for cancer diagnostics, and Appellants' assertions of utility are supported by differential 
expression data of the encoding mRNA. Thus, Appellants submit that the above-discussed court 
cases which followed Brenner v. Manson are most related to the instant situation, and these cases 
have held in favor of a utility for the claimed subject matter. 

In conclusion, Appellants have asserted that the claimed antibodies are useful for the 
diagnosis of cancer, particularly stomach and lung cancer based on the data in Example 18. This 
utility is far beyond the nebulous expressions "biological activity" or "biological properties" 
rejected in In re Kirk, 376 F.2d 936, 153 U.S.P.Q. 48 (C.C.P.A. 1967). Like Nelson, Cross, and 
Fujikawa, Appellants have asserted a utility which relies on a reasonable correlation between the 
data disclosed in the application and the asserted utility. The fact that there may be limited 
evidence that the correlation is not exact does not invalidate Appellants' showing of utility since 
the correlation need not be a rigorous or exact one. Considering the relevant evidence as a whole, 
Appellants have provided sufficient evidence to establish a reasonable correlation between 
changes in the level of mRNA and corresponding changes in the level of the encoded 

-43- 



Appl. No, : 10/063,587 

Filed : May 3, 2002 



polypeptide. Therefore the claimed antibodies have a practical utility as diagnostic tools for 
stomach and lung cancer. 

10. Utility - Conclusion 

Appellants' asserted utility for the claimed antibodies as diagnostic tools for cancer 
corresponds in scope to the subject matter sought to be patented and therefore "must be taken as 
sufficient to satisfy the utility requirement of § 101 for the entire claimed subject." In re Langer, 
503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). The Examiner's unsupported 
arguments and references are not sufficient evidence to make a prima facie showing that "one of 
ordinary skill in the art would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 
1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). 

And even if the Examiner has established a prima facie case, Appellants have offered 
sufficient rebuttal evidence in the form of expert declarations and references, which, when 
considered as a whole, establish that it is more likely than not that the asserted utility is true. See 
In re Oetiker, 977 F.2d 1443, 1445, 24 U.S,P.Q.2d 1443, 1444 (Fed. Cir. 1992) (stating that the 
evidentiary standard to be used throughout ex parte examination in setting forth a rejection is a 
preponderance of the evidence, or "more likely than not" standard); M,P.E.P, at § 2107.02, part 
VII ("evidence will be sufficient if, considered as a whole, it leads a person of ordinary skill in 
the art to conclude that the asserted utility is more likely than not true .") (emphasis in original). 

Finally, the courts' decisions in similar cases make clear that the evidence provided by 
Appellants is sufficient to establish the asserted utility. The evidence does not need to be direct 
evidence, nor does it need to provide an exact correlation between the submitted evidence and 
the asserted utility. Instead, evidence which is "reasonably" correlated with the asserted utility is 
sufficient. See Fujikawa, 93 F.3d at 1565 ("a 'rigorous correlation' need not be shown in order 
to establish practical utility; 'reasonable correlation' suffices"); Cross, 753 F.2d at 1050 (same); 
Nelson, 626 F.2d at 857 (same). Considering the evidence as a whole in light of the relevant 
cases, the Board should find that Appellants have established at least one specific, substantial, 
and credible utility, and the Examiner's rejection of the pending claims as lacking utility should 
be reversed. 
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C. Enablement Rejection 

The second issue before the Board is whether Appellants have enabled the pending 
claims such that one of skill in the art would be able to make and use the claimed invention. The 
Examiner has rejected Claims 1-5 under 35 U.S.C. § 1 12, first paragraph, asserting that because 
the claimed invention is not supported by either a specific or substantial asserted utility or a well- 
established utility, one skilled in the art would not know how to use the claimed invention. See 
Office Action dated June 22, 2005, at page 2. 

The Examiner recites the factors for determining enablement from In re Wands, 858 F.2d 
731, 8 U.S.P.Q. 2d 1400 (Fed. Cir. 1988), relying on the arguments made in support of the 
rejection for lack of utility. 

Appellants submit that Claims 1 -5 are enabled such that one of skill in the art could make 
and use the claimed antibodies without undue experimentation. Applicants submit that how to 
make an antibody that specifically binds to the polypeptide of SEQ ID NO: 78 was within the 
skill in the art. See In re Wands, 858 F.2d 731 (reversing the Board's decision of non- 
enablement and holding that as of 1980, undue experimentation was not required to make high- 
affinity monoclonal antibodies to a target peptide). Thus, one of skill in the art would be able to 
make the claimed antibodies without undue experimentation. The Examiner has not made any 
arguments to the contrary. 

As described above, Appellants assert that the claimed antibodies are useful as diagnostic 
tools for cancer, particularly lung and stomach cancer. This use is based in part on the disclosure 
in Example 18 of the instant application that the nucleic acid encoding the PRO 13 57 polypeptide 
is at least two-fold differentially expressed in lung and stomach tumor relative to normal lung 
and stomach, respectively. As detailed above, it is well-established that changes in expression 
levels of mRNA leads to corresponding changes in expression levels of the encoded polypeptide, 
and thus it is likely that the PRO 1357 polypeptide also is differentially expressed in lung and 
stomach cancer. Thus, based on the disclosure in the application, one of skill in the art would be 
able to use the claimed antibodies as diagnostic tools to distinguish suspected lung and stomach 
tumors from normal tissue without undue experimentation. 
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1. Enablement - Lesal Standard 

An application enables the claims "if one skilled in the art, after reading the[] disclosure[], 
could practice the invention claimed ... without undue experimentation." Chiron Corp. v. 
GenentecK Inc., 363 F.3d 1247, 1253 (Fed. Cir. 2004). "But the question of undue 
experimentation is a matter of degree. The fact that some experimentation is necessary does not 
preclude enablement; what is required is that the amount of experimentation 'must not be unduly 
extensive.'" PPG Indus., Inc. v. Guardian Indus., Corp., 75 F.3d 1558, 1564 (Fed. Cir. 1996) 
(quoting Atlas Powder Co. v. E.I DuPont de Nemours & Co., 750 F.2d 1569, 1576 (Fed. Cir. 
1984)). 

While the application must enable one skilled in the art to practice the full scope of the 
claimed invention, "[t]hat is not to say that the specification itself must necessarily describe how 
to make and use every possible variant of the claimed invention, for the artisan's knowledge of 
the prior art and routine experimentation can often fill gaps, interpolate between embodiments, 
and perhaps even extrapolate beyond the disclosed embodiments, depending upon the 

> 

predictability of the art." AK Steel Corp v. Sollac, 344 F.3d 1234, 1244 (Fed. Cir. 2003). 

"Enablement is not precluded by the necessity for some experimentation such as routine 
screening. However, experimentation needed to practice the invention must not be undue 
experimentation. The key word is 'undue,' not 'experimentation.'" In re Wands 858 F.2d 731, 
736-7, 8 U.S.P.Q.2d 1400, (Fed. Cir. 1988), citations omitted. 

It is equally clear that a rejection based on "lack of utility," whether grounded 
upon 35 U.S.C. 101 or 35 U.S.C. 112. first paragraph, rests on the same basis (i.e., 
the asserted utility is not credible) . To avoid confiision, any rejection that is 
imposed on the basis of 35 U.S.C. 101 should be accompanied by a rejection 
based on 35 U.S.C. 112, first paragraph. The 35 U.S.C. 112, first paragraph, 
rejection should be set out as a separate rejection that incorporates by reference 
the factual basis and conclusions set forth in the 35 U.S.C. 101 rejection. The 35 
U.S.C. 1 12, first paragraph, rejection should indicate that because the invention as 
claimed does not have utility, a person skilled in the art would not be able to use 
the invention as claimed, and as such, the claim is defective under 35 U.S.C. 1 12, 
first paragraph. A 35 U.S.C. 1 12, first paragraph, rejection should not be imposed 
or maintained unless an appropriate basis exists for imposing a rejection under 35 
U.S.C. 101. In other words. Office personnel should not impose a 35 U.S.C. 
112, first paragraph, rejection grounded on a ^4ack of utility^^ basis unless a 
35 U.S,C. 101 rejection is proper . In particular, the factual showing needed to 
impose a rejection under 35 U.S.C. 101 must be provided if a rejection under 35 
U.S.C. 1 12, first paragraph, is to be imposed on "lack of utility" grounds. 
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To avoid confusion during examination, any rejection under 35 U.S.C. 112, first 
paragraph, based on grounds other than "lack of utility" should be imposed 
separately from any rejection imposed due to "lack of utility" under 35 U.S.C. 
101 and 35 U.S.C. 1 12, first paragraph. M.P.KP, § 2107.01 IV (emphasis added). 

2. Enablement - Burden of Proof 

In order to make an enablement rejection, the PTO has the initial burden to establish a 
reasonable basis to question the enablement provided for the claimed invention. See M.P.E.P. 
§ 2164.04. A specification teaching how to make and use the claimed subject matter must be 
taken as being in compliance with the enablement requirement unless there is a reason to doubt 
the objective truth of the statements contained therein which are relied on for enabling support. 
Id, It is incumbent for the PTO "to explain why it doubts the truth or accuracy of any statement 
in a supporting disclosure and to back up assertions of its own with acceptable evidence or 
reasoning which is inconsistent with the contested statement." Id (quoting In re Marzocchi, 439 
F.2d 220, 224, 169 U.S.P.Q. 367, 370 (C.C.P.A. 1971). This can be done "by making specific 
findings of fact, supported by the evidence, and then drawing conclusions based on these 
findings of fact." Id. 

3. Enablement - Standard of Proof 

Once the examiner has weighed all the evidence and established a reasonable basis to 
question the enablement provided for the claimed invention, the burden falls on the applicant to 
present persuasive arguments, supported by suitable proofs where necessary, that one skilled in 
the art would be able to make and use the claimed invention using the application as a guide. See 
M.P.E.P. § 2164.05. "The evidence provided by applicant need not be conclusive but merely 
convincing to one skilled in the art." Id. (bold emphasis added, underline in original). "A 
declaration or affidavit is, itself, evidence that must be considered ." Id, (emphasis in original). 

The examiner must then "weigh all the evidence before him or her, including the 
specification and any new evidence supplied by applicant with evidence and/or sound scientific 
reasoning previously presented in the rejection and decide whether the claimed invention is 
enabled." Id. "The examiner should never make the determination based on personal opinion. 
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The determination should always be based on the weight of all the evidence." Id (emphasis in 
original). 

4. Appellants' Claimed Antibodies have Utility, and, therefore are Enabled 
Appellants submit that because the claimed antibodies have substantial, specific and 
credible utility, it is not proper to reject the claimed antibodies as lacking enablement on a "lack 
of utility" basis. In the Office Action mailed June 22, 2005, the Examiner's reasoning for 
rejecting the claims as lacking utility are closely interwoven with the Examiner's reasoning for 
rejecting the claims as lacking enablement. While Appellants acknowledge that claims can be 
rejected as drawn to subject matter having utility while nevertheless lacking enablement, in the 
instant case, the Examiner's reasoning and submitted evidence for holding that the claims lack 
enablement are the same as those for holding that the claims lack utility. Thus, by interweaving 
utility and enablement rejections, the Examiner demonstrates that the enablement rejection is 

« 

grounded on a "lack of utility" basis. 
Under the M.P.E.P.: 

■ 

A 35 U.S.C. 112, first paragraph, rejection should not be imposed or maintained 
unless an appropriate basis exists for imposing a rejection under 35 U.S.C. 101. In 
other words. Office personnel should not impose a 35 U.S.C. 1 12, first paragraph, 
rejection grounded on a ''lack of utility" basis unless a 35 U.S.C. 101 rejection is 
proper. MP.£.P. § 2107.01 IV (emphasis added). 

Appellants submit that because the Examiner's enablement rejection is grounded on a 
"lack of utility" basis, the present enablement rejection is only proper if the utility rejection is 
proper. Appellants have argued above that one skilled in the art would have believed the 
claimed antibodies have a substantial, specific and credible utility, and, thus, a utility rejection 
for the claimed antibodies is not proper. Appellants further submit that because a utility rejection 
for the claimed antibodies is not proper, the Examiner's enablement rejection of the claimed 
antibodies also is not proper. 

The Examiner appears to take a different position on this matter. In response to 
Appellants' citation of In re Brana as holding that usefulness in patent law necessarily includes 
the expectation of further research and development, the Examiner stated: 
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The argument has been fully considered, but is not persuasive. While Brana did 
deal with a rejection under 35 USC 1 12, first paragraph, the rejection was direct 
[sic] toward utility-specific, substantial and credible use-instead of enablement. 
While it is true that administration of a pharmaceutical to a human is not always 
necessary for either utility or enablement, one must know how to use the 
invention without undue experimentation. In the instant situation, Applicants 
claim a polypeptide which is at least 95% identical to SEQ ID NO:78, which it is 
maintained the disclosure does not enable the use of. Office Action dated June 22, 
2005, at page 3. 

Thus, the Examiner acknowledges Brana considered the propriety of a rejection under 35 U.S.C. 

§112, first paragraph, by evaluating whether or not the claimed subject matter had a utility. 

However, the Examiner appears to take the position that Brana is not applicable to the present 

enablement rejection because Brana was directed to a finding of a lack of utility. Appellants 

submit that Brana is applicable to the present case because the instant enablement rejection is 

based on the assertion that one would not have known how to use a polypeptide that lacked 

utility, and, thus, very closely parallels the rejection at issue in Brana. Thus, in accordance with 

Brana, Appellants submit that because one skilled in the art would have recognized the utility of 

the claimed antibodies (as discussed above), one skilled in the art would have knovra how to use 

the claimed antibodies. 

In the Office Action mailed June 22, 2005, interspersed between sections directed to the 

utility rejection, the Examiner included a section discussing the factors to be considered in 

determining whether a disclosure meets the enablement requirement as described in In re Wands, 

858 F.2d 731, 8 U.S.P.Q. 2d 1400 (Fed. Cir. 1988), in evaluating whether Appellants' asserted 

utility would have enabled one skilled in the art to use the claimed antibodies. Briefly, the 

Examiner's "Wands factors" analysis is based on arguments already used in asserting that the 

claims lack utility: the protein did not have an art-recognized use, there is evidence that nucleic 

acid expression does not correlate with protein expression, and the specification does not provide 

sufficient experimental details. Furthermore, by interspersing arguments directed toward lack of 

utility with arguments directed toward lack of enablement, the Examiner demonstrates that the 

enablement rejection is based on lack of utility grounds. If the enablement rejection were to be 

based on grounds other than lack of utility, the utility and enablement rejections should have 

been imposed separately according to the M.P.E.P., which admonishes: 

To avoid confiision during examination, any rejection under 35 U.S.C. 112, first 
paragraph, based on grounds other than "lack of utility" should be imposed 
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separately from any rejection imposed due to "lack of utility" under 35 U.S.C. 
101 and 35 U.S.C. 1 12, first paragraph. M.P.E.P. § 2107.01 IV, 

The Examiner did not separate the enablement rejection from the utility rejection. Moreover, the 
Examiner's "Wands factors" analysis is not based on any argument or evidence not already used 
by the Examiner in holding that the claims lack utility. As such, the Examiner's "Wands factors" 
analysis is grounded on a "lack of utility" basis; accordingly, the Examiner's enablement 
rejection is only proper if the utility rejection is proper. 

In view of the above. Appellants submit that if the claimed antibodies are determined to 
have a substantial, specific and credible utility, the enablement rejection of the claims is 
improper because the enablement rejection is grounded on the lack of utility of the claimed 
antibodies. Appellants submit that in the above sections Appellants have demonstrated that one 
skilled in the art would have believed that the claimed antibodies have a substantial, specific and 
credible utility. Accordingly, Appellants submit that the enablement rejection of these claims is 
improper. 

5. Appellants ' Specification Teaches How to Make and Use the Claimed Subject 
Matter 

Leaving aside Appellants' argument that the scope of the enablement rejection extends no 
further than the utility rejection, Appellants submit that the specification enables one skilled in 
the art to make and use the full scope of the claims without undue experimentation. The claimed 
subject matter relates to antibodies that specifically bind the polypeptide of SEQ ID NO: 78. 
The specification discloses how to make the claimed antibodies, for example in paragraphs 
[0365]-[0374] and Example 10, Similar methods also were known in the art. In addition, the 
specification discloses that the claimed antibodies can be used in diagnostic assays to detect the 
expression of PR01357 in specific types of tissue. See e.g.. Specification at TI[0407]. In light of 
the differential expression of the nucleic acid encoding the PR01357 polypeptide in lung and 
stomach tumors compared to normal lung and stomach, respectively, one of skill in the art would 
have expected the PR01357 polypeptide to be differentially expressed in these tumors as well. 
Therefore, given the teaching in the specification on how to make and use the claimed antibodies 
to detect expression of PR01357 polypeptide in specific tissues, one of skill in the art would 
have been enabled to practice the claimed invention without undue experimentation. 
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Because Appellants' specification teaches how to make and use the claimed subject 
matter, it must be taken as being in compliance with the enablement requirement unless there is a 
reason to doubt the objective truth of the statements contained therein which are relied on for 
enabling support. See M.P.KP. §2164.04. 

6. The Examiner's Arsuments and Appellants' Responses 

As mentioned above, the Examiner has relied on the factors to be considered in 
determining whether a disclosure meets the enablement requirement as described in In re Wands, 
858 F.2d 731, 8 U.S.P.Q. 2d 1400 (Fed. Cir. 1988). The Examiner's arguments are solely 
directed at the "use" prong of the enablement question. Despite Appellants' position that the 
Examiner's "Wands factors" analysis is based on the asserted lack of utility, Appellants 
nevertheless reply to the Examiner's arguments regarding each "Wands factor" in tum below. 

a. The breadth of the claims 

The Examiner asserts that the "breadth of the claims is broad, encompassing structural 
variation." Office Action dated June 22, 2005, at page 4. No other reasoning is provided. 
Appellants submit that not all claims that encompass a variation in structure are per se broad (see, 
e.g, In re Wands), and that the presently claimed antibodies are members of a finite genus of 
highly related polypeptides with common activities, i.e., the ability to specifically bind to the 
polypeptide of SEQ ID NO:78. 

The Examiner fails to indicate why a claim encompassing the recited antibodies 
constitutes a "broad" claim. Further, this finding is in contrast to precedent of the Federal Circuit 
which has not held that claims directed to antibodies with designated binding properties are 
unduly broad (see, e.g, In re Wands). 

Appellants submit that the scope of the claims weighs in favor of enablement. The 
Examiner provides no reasoning to conclude otherwise. 

b. The nature of the invention 

The Examiner argues that the nature of the invention is a polypeptide without a 
"recognized/characterized physiological/biochemical property." 
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The nature of the invention is an antibody. Antibodies have been known in the art for 
many years, and methods for making, assaying and using antibodies are well established. As 
discussed below, Appellants' specification provides the specific teachings relevant to the claimed 
antibodies including, for example, use for diagnostic purposes. 

Thus, contrary to the Examiner's assertions, the nature of the invention weighs in favor of 
enablement as it is a predictable and well-established aspect of biology, and Appellants' 
specification provides specific teachings regarding the claimed antibodies. 

c. The state of the prior art 

Regarding the state of the prior art, the Examiner states that the connection of SEQ ID 
NO: 77 to tumors was not known, and that the relationship of PRO 1357 to a family of proteins 
was not known. 

Appellants acknowledge the Examiner's statements toward the novelty of the claimed 
antibodies. 

Regarding the state of the prior art, however. Appellants submit that methods of making 
antibodies such as the claimed antibodies were well known in the art, and methods of using 
antibodies for detecting polypeptides such as the PRO 1357 polypeptide in tissue samples were 
well known in the art. As discussed above in relation to utility, the Appellants' specification has 
provided teachings regarding differential expression of mRNA encoding the PRO 1357 
polypeptide in tumors which, based on the state of the prior art, is sufficient to indicate that the 
PRO 1357 polypeptide is differentially expressed in tumors. Thus, the prior art's lack of specific 
teachings of PR01357 is moot for purposes of enablement of the claimed antibodies. 

Because methods of making and using polypeptides and methods of making and using 
antibodies that bind particular polypeptides were well known in the prior art, this factor weighs 
in favor of enablement. 

d. Level of skill in the art 

The Examiner states that the skill in the art of differential nucleic acid screening has 
existed for over a decade, but depends on "relative or absolute levels of the difference(s), the 
ability to generalize to more than one cell culture or tumor type or, conversely, the ability to 
pinpoint a particular tumor type (e.g., adenocarcinoma versus squamal), and repeatability of the 
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differential expression both in terms of frequency/pre valence and quantity/sensitivity." Office 
Action dated June 22, 2005, at page 4. No evidence is provided to support this characterization 
of the skill in the art. 

Appellants submit that the level of skill in the art of differential nucleic acid screening is 
not relevant to a determination of whether one skilled in the art would have been able to use the 
claimed antibodies. The Examiner does not focus on whether one skilled in the art would have 
been able to use the claimed antibodies, but instead whether Appellants' differential nucleotide 
expression data is credible, substantial and specific enough to establish that one of ordinary skill 
would have expected an assay using the claimed antibodies to yield a useful experimental result. 
As discussed above, this ground for rejection is consistent with In re Brana and is based on lack 
of utility grounds. Furthermore, the Examiner acknowledges the utility of the differentially 
expressed nucleic acid encoding the PRO 1357 polypeptide. Thus, in view of the Examiner's 
finding of utility for the differentially expressed nucleic acid encoding the PRO 1357 polypeptide, 
this basis for rejecting the claims as lacking enablement is moot. 

Even if the Examiner's statements regarding differential nucleic acid screening were 
relevant, the Examiner makes assertions of fact not supported by any evidence. The unsupported 
assertion of fact regarding experimental specifics upon which differential nucleic acid screening 
depends represents a statement by official notice. See, e.g.. In re Zurko, 258 F.3d 1379, 1385, 59 
U.S.P.Q.2d 1693, 1697 (Fed. Cir. 2001); and In re Ahlert, 424 F.2d 1088, 1091, 165 U.S.P.Q. 
418, 420 (CCPA 1970). Appellants submit that the experimental specifics for differential 
nucleic acid screening asserted by official notice are not well known or capable of instant and 
unquestionable demonstration as being well-known. Thus, in accordance with M.P.E.P. 
§ 2144.03C, the Examiner must provide documentary evidence demonstrating such asserted facts. 
In the absence of such evidence, this factual assertion cannot be established. 

The Examiner further states that "there is evidence in the prior art that even for those 
nucleic acids differentially expressed in tumors, a correlated expression for the encoded protein 
is not a given." Office Action dated June 22, 2005, at page 4. Appellants presume the Examiner 
is referring to the reference by Haynes. Appellants again submit that this ground for rejection is 
based on utility, and thus, is consistent with In re Brana. Furthermore, Appellants have provided 
evidence and arguments in the utility discussion above demonstrating that one or ordinary skill in 
the art would have expected that a change in mRNA expression levels would typically lead to a 
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corresponding change in the level of the encoded polypeptide. Thus, in view of Appellants' 
previously presented evidence and arguments, this basis for rejecting the claims as lacking 
enablement is moot. 

Appellants submit that the level of skill in the art was high for methods for using 
antibodies to detect polypeptides, as taught in the specification, for example, at paragraph [0407] 
and references cited therein. The Examiner provides no reason to conclude otherwise. 

Furthermore, Appellants submit that, contrary to the Examiner's assertions, one of 
ordinary skill in the art would have expected that an assay using the claimed antibodies would 
yield a useful experimental result. Appellants have responded at length to the Examiner's 
arguments in the utility discussion above, and refer the Board to the discussion of the utility 
rejection supra. 

In view of the high the level of skill in the art for methods of using antibodies to detect 
polypeptides, and in view of the credible, substantial and specific teachings of differential 
PRO 1357 polypeptide expression in lung and stomach tumor relative to normal lung and 
stomach, respectively, the level of skill in the art for using the claimed antibodies to yield a 
useful experimental result was high. Accordingly, this factor weighs in favor of enablement. 

e. Level of predictabiliy in the art 
The Examiner does not assert that predictability in the art would suggest a lack of 
enablement of the claimed antibodies. Appellants presume the Examiner to have intended to 
treat the level of skill in the art and predictability in the art together. Thus, Appellants submit 
that, in view of the above discussion, one skilled in the art would have been able to predictably 
use the claimed antibodies in diagnostic methods. As such, this factor weighs in favor of 
enablement. 

/ Guidance in the specification 
The Examiner states: 

There is very little guidance or direction about using the claimed polypeptide 
except that the encoded nucleic acid of SEQ ID NO: 77 is underexpressed in 
stomach and lung tumors. As discussed in previous Office actions, the specific 
type of tumor is not disclosed, nor are levels of expression, relative amounts or 
how many different tumor cDNA libraries from each tumor tissue were screened, 
for example." Office Action dated June 22, 2005, at page 4. 
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Again, Appellants submit that the statements of the Examiner are directed to the 
disclosed differential expression of nucleic acid encoding the PRO 1357 polypeptide, and attempt 
establish a lack of enablement based on the grounds of a lack of utility. 

The specification provides ample guidance to one skilled in the art for the use of the 
claimed antibodies. As discussed above in relation to the utility rejection, the specification 
teaches that the polypeptide of SEQ ID NO: 78 (referred to as "PRO 1357 polypeptide") is 
encoded by the polynucleotide of SEQ ID NO: 77 (also referred to as DNA64881-1602). 
Specification at Tflj [0103-0104]. In "Example 18: Tumor Versus Normal Differential Tissue 
Expression Distribution" Appellants disclose that the mRNA encoding PRO 1357 polypeptide is 
more highly expressed in normal stomach and lung tissue compared to stomach and lung tumor, 
respectively. Specification at [0529]-[0530] and accompanying tables. As explained in 
paragraph [0530], the differential expression of the PR01357 mRNA was detected using the 
well-established technique of quantitative PGR amplification of cDNA libraries isolated from 
different human normal and tumor tissue samples. To ensure that equivalent amounts of nucleic 
acid were used in each reaction, the cDNA for P-actin was used as a control. 

The specification teaches that identification of the differential expression of a PRO 
polypeptide-encoding mRNA in one or more tumor tissues as compared to one or more normal 
tissues of the same tissue type "renders the molecule useful diagnostically for the determination 
of the presence or absence of tumor in a subject suspected of possessing a tumor." Specification 
at \ [0530]. The Examiner has recognized that the teachings of differential expression of the 
PR01357 mRNA in the specification are sufficient to establish a utility for the nucleic acid 
encoding PRO 1357 polypeptide: "[I]t is agreed that the polynucleotide of SEQ ID NO:77 has 
this specific utility." Office Action dated June 22, 2005, at page 10. 

Appellants have discussed above that it was well established in the art that changes in 
mRNA levels lead to changes in the level of the encoded protein, and the specification need not 
teach this relationship. Thus, Appellants submit that, based on the teachings of the specification 
and the knowledge in the art, one would have expected the PRO 1357 protein to be differentially 
expressed in stomach and lung tumors. 

The specification states that PRO polypeptides "may also be used diagnostically for 

tissue typing, wherein the PRO polypeptides of the present invention may be differentially 

expressed in one tissue as compared to another, preferably in a diseased tissue as compared to a 
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normal tissue of the same tissue type." Specification at [0336]. The specification also 

discloses that PRO polypeptides and polypeptides related thereto can be used to generate anti- 

PRO antibodies. Id. at ^ [0364] and ^ [0367]. The specification teaches that such antibodies to 

PRO polypeptides can be as diagnostic tools: 

[A]nti-PRO antibodies may be used in diagnostic assays for PRO [polypeptide], 
e.g., detecting its expression (and in some cases, differential expression) in 
specific cells, tissues, or serum. Various diagnostic assay techniques known in 
the art may be used, such as competitive binding assays, direct or indirect 
sandwich assays and immunoprecipitation assays conducted in either 
heterogeneous or homogeneous phases. Specification at \ [0407]. 

Taken together, the specification provides substantial guidance regarding the use of the 
claimed antibodies as tools in diagnosing cancer, particularly stomach and lung cancer. 

The Examiner argues that the specification provides "very little guidance or direction" 
because various experimental specifics such as "specific type of tumor," "levels of expression, 
relative amounts or how many different tumor cDNA libraries from each tumor were screened" 
were not provided in the specification. No evidence is provided to indicate why such 
information must be disclosed. There is no requirement that a specification teach a level of 
experimental details that would obviate the need for any experimentation. Instead, 
"[e]nablement is not precluded by the necessity for some experimentation such as routine 
screening." In re Wands, 858 F.2d 731, 736-7, 8 U.S.P.Q. 2d 1400. The Examiner provides no 
evidence or reasoning for why absence of such experimental specifics would require more than 
routine screening for one skilled in the art to use the claimed antibodies. Accordingly, there is 
no factual basis for concluding that the specification provides "very little guidance or direction." 

Despite the lack of evidentiary support for the Examiner's position, Appellants submit 
that any additional experimental specifics represent routine experimental details that are readily 
determined by one skilled in the art in accordance with the knowledge in the art and the 
teachings of the specification. Appellants' specification identifies the PRO 1357 polypeptide, 
teaches methods for preparing antibodies to the PRO 1357 polypeptide, teaches methods for 
using such antibodies, and teaches particular organs to target in methods of using such antibodies. 
Specific experimental details for the use of the polypeptides and antibodies can be determined 
according to the methods that were well known in the art. 
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Thus, contrary to the Examiner's assertions, the specification provides sufficient 
guidance on how to use the claimed antibodies such that this factor weighs in favor of 
enablement. 

g. Workins examples 

The Examiner does not assert that the application does not provide sufficient working 
examples. Appellants' working example demonstrates that the nucleic acid encoding the 
PRO 1357 polypeptide is differentially expressed in stomach and lung tumor tissue relative to 
normal stomach and lung tissue, respectively. 

In "Example 18: Tumor Versus Normal Differential Tissue Expression Distribution" 
Appellants disclose that the mRNA encoding PRO 1357 polypeptide is more highly expressed in 
normal stomach and lung tissue compared to stomach and lung tumor, respectively. 
Specification at [0529]-[0530] and accompanying tables. As explained in paragraph [0530], 
the differential expression of the PRO 1357 mRNA was detected using the well-established 
technique of quantitative PGR amplification of cDNA libraries isolated from different human 
normal and tumor tissue samples. To ensure that equivalent amounts of nucleic acid were used 
in each reaction, the cDNA for P-actin was used as a control. 

Appellants submit that, in view of differential expression disclosed in Example 18 and 
the knowledge in the art, one skilled in the art would also have expected that the PRO 1357 
polypeptide was differentially expressed. As such, Appellants submit that the disclosure in 
Example 1 8 weighs in favor of enablement. 

h. The Quantity of experimentation 

The Examiner makes no assertion regarding the quantity of experimentation required to 
use the claimed antibodies. 

Appellants submit that the claimed antibodies can be used as diagnostic tools for cancer 
by detecting the differential expression of the PRO 1357 polypeptide in stomach and lung tumors 
relative to normal stomach and lung tissue, respectively. Appellants submit that methods of 
using antibodies such as the claimed antibodies were well-established in the art. Further, the 
specification teaches particular organs to target in methods of using such antibodies. 
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Thus, one skilled in the art would have known how to make and use the claimed 
antibodies, and would have known particular organs to target in detecting PRO 1357. As such, 
experiments could have readily been designed to directly target PRO 13 57, with no more than 
routine amounts of experimentation required. Because the quantity of experimentation is low, 
this factor weighs in favor of enablement. The Examiner provides no reason to conclude 
otherwise. 

/. Wands Factors - Conclusion 

Appellants submit that in view of the well-defined scope of the claims, the nature of the 
claimed invention, the state of the prior art, the high level of skill in the art, the teachings of the 
specification, and the quantity of experimentation required to use the claimed subject matter, one 
skilled in the art could have used the claimed antibodies without undue experimentation. 

In the Examiner's entire analysis of the Wands factors, the Examiner points only once to 
evidence. The Examiner states that there is evidence in the prior art that even for nucleic acids 
differentially expressed in tumors, a correlated expression for the encoded protein is not a given. 
Appellants presume the Examiner to be referring to the publication by Haynes, which is 
discussed in more detail above in regard to the utility rejection. Appellants have established that 
the totality of the record demonstrates that one skilled in the art would be more likely than not to 
expect a change in the level of mRNA to generally lead to a corresponding change in the level of 
the encoded polypeptide. Thus, the totality of the record does not support the Examiner's 
characterization of the knowledge in the art. Accordingly, in view of the teachings in the 
specification and the knowledge in the art, as exemplified in evidence submitted by Appellants 
and the Examiner, one skilled in the art would have expected the PRO 1357 polypeptide to be 
differentially expressed. 

The Examiner argues that the specification is insufficient because various experimental 
specifics were not provided in the specification, and without disclosing such specifics, it would 
require undue experimentation to use the claimed antibodies. The fact that experimentation may 
be complex does not necessarily make it undue, if the art typically engages in such 
experimentation. M.P.E.P. § 2164.01; re Certain Limited-Charge Cell Culture Microcarriers, 
221 U.S.P.Q. 1165, 1174 (Int'l Trade Comm'n 1983), ajf'd sub nam., Massachusetts Institute of 
Technology v. A.B, Fortia, 774 F.2d 1104, 227 U.S.P.Q. 428 (Fed. Cir. 1985). See also In re 
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Wands, 858 F.2d at 737, 8 U.S.P.Q.2d at 1404. The test of enablement is not whether any 
experimentation is necessary, but whether, if experimentation is necessary, it is undue. In re 
Angstadt, 537 F.2d 498, 504, 190 U.S.P.Q. 214, 219 (CCPA 1976). The test is not merely 
quantitative, since a considerable amount of experimentation is permissible, if it is merely 
routine, or if the specification in question provides a reasonable amount of guidance with respect 
to the direction in which the experimentation should proceed. In re Wands, 858 F.2d at 737, 
8 U.S.P.Q.2d at 1404, citing In re Jackson, 217 U.S.P.Q. 804, 807-808 (Bd. App. 1982). Based 
on the teachings of the specification and the level of skill in the art, it was routine to make and 
use antibodies such as the claimed antibodies, in, for example, detecting proteins in designated 
tissue samples. No undue experimentation was required for a Ph.D. scientist with several years 
of experience to use these routine methods, in view of the teachings in the specification, in order 
to determine details such as the binding properties of the claimed antibodies, the ability of an 
antibody to bind to a sample, or specific details of sample binding. Accordingly, it would not 
have required undue experimentation for one skilled in the art to make and use the claimed 
antibodies. The claimed invention is, therefore, fully enabled. Moreover, the Examiner provides 
no evidence to support an assertion that, absent various specific experimental details, it would 
require undue experimentation to use the claimed antibodies. Absent such evidence, there is no 
reasonable basis to question the sufficiency of the disclosure. 

In view of the above, Appellants submit that the Wands factors support a finding that the 
specification, in view of the knowledge in the art, fully enabled the use of the claimed antibodies. 
The Examiner has provided no significant evidence or argument to the contrary. 

7. Enablement - Conclusion 

The Examiner's enablement rejection is grounded on a "lack of utility" basis, and, thus, 
the present enablement rejection is only proper if the utility rejection is proper. Appellants have 
argued above that one skilled in the art would have believed that the claimed antibodies have a 
substantial, specific and credible utility, and, thus, a utility rejection for the claimed antibodies is 
not proper. Appellants therefore submit that because a utility rejection for the claimed antibodies 
is not proper, the Examiner's enablement rejection of the claimed antibodies also is not proper. 

Furthermore, for the reasons discussed above, all of the In re Wands factors weigh in 
favor of a finding of enablement. The Examiner has made no attempt to argue that one of skill in 

-59- 



Appl. No. : 10/063,587 

Filed : May 3, 2002 

the art would be unable to make the claimed antibodies. The Examiner's arguments are 
essentially the same as those made in support of the utihty rejection, which fail for the reasons 
articulated above. Furthermore, the Examiner submits no significant evidence to support a 
finding of undue experimentation. Therefore, the Examiner has failed to meet the initial burden 
of establishing a reasonable basis to question the enablement provided for the claimed invention. 
SeeM,P.KP. §2164.04. 

And even if the Examiner has met her initial burden, Appellants have pointed to specific 
teachings in the specification and have presented the evidence discussed above with respect to 
utility, that one skilled in the art would be able to make and use the claimed invention using the 
application as a guide. Appellants remind the Board that "[t]he evidence provided by applicant 
need not be conclusive but merely convincing to one skilled in the art." M.P.E.P, § 2164.05 
(emphasis in original). 

Considering all of the evidence provided by the Appellants to establish their asserted 
utility, along with the disclosure in the specification, the Board should find that Appellants have 
established that one of skill in the art would be able to make and use the claimed invention 
without undue experimentation, and the Examiner's rejection of the pending claims as lacking an 
enabling disclosure should be reversed. 

D. Anticipation Rejection 

The third issue before the Board is whether the claimed subject matter is anticipated 
under 35 U.S.C. § 102(b) over WO 01/16318 (published March 8, 2001) or WO 00/12708 
(published March 9, 2000). 

The Examiner asserts that "[bjecause the claims do not meet the requirements of 35 
U.S.C. § 112, first paragraph, ... and the earlier application[s] likewise do not meet those 
requirements, the instant application does not receive the benefit of priority to earlier filed 
applications." Office Action dated June 22, 2005, at page 11. 

Appellants have previously listed the priority information for the instant application in a 
Preliminary Amendment mailed September 3, 2002. The preliminary amendment states that the 
instant "application is a continuation of, and claims priority under 35 U.S.C. § 120 to, US 
Application 10/006867 filed 12/6/2001, which is a continuation of, and claims priority under 35 
U.S.C. § 120 to, PCT Application PCT/USOO/23328 filed 8/24/2000, which is a continuation-in- 
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part of, and claims priority under 35 U.S.C. § 120 to, US Application 09/403297 filed 
10/18/1999, now abandoned, which is the National Stage filed under 35 U.S.C. § 371 of PCT 
Application PCT/US99/20111 filed 9/1/1999, which claims priority under 35 U.S.C. § 119 to 
U.S. Provisional Application 60/099741 filed 9/10/1998." 

The sequences of SEQ ID NOs: 77 and 78 were first disclosed in U.S. Provisional 
Application 60/099,741 filed 9/10/1998 in Figures 1 and 2. The data in Example 18 (Tumor 
Versus Normal Differential Tissue Expression Distribution), relied on in part for the utility of the 
claimed antibodies, were first disclosed in PCT Application PCT/USOO/23328 filed 8/24/2000, 
on page 93, line 3, through page 96, line 35. 

Appellants submit that, in view of the arguments above, the claimed antibodies have 
utility and are fully supported by the specification in accordance with 35 U.S.C. § 112, first 
paragraph. Moreover, Appellants submit that the previously filed applications, to which 
Appellants have properly claimed priority, also support the claimed antibodies. Even if it were 
to be determined that Appellants are not entitled to their earliest priority date, the subject matter 
of the present application was disclosed in, and therefore is entitled to the priority date of, PCT 
Application PCT/USOO/23328 filed August 24, 2000. Accordingly Appellants are entitled to a 

« « 

priority date no later than August 24, 2000. 

WO 01/16318 was published March 8, 2001. Thus, WO 01/16318 was not published 
more than one year prior to Appellants' priority date, as required under 35 U.S.C. § 102(b). 
Accordingly, WO 01/16318 cannot be prior art under 35 U.S.C. § 102(b). 

Similarly, WO 00/12708 was published March 9, 2000. Thus, WO 00/12708 was not 
published more than one year prior to Appellants' priority date, as required under 35 U.S.C. 
§ 102(b). Accordingly, WO 00/12708 cannot be prior art under 35 U.S.C. § 102(b). 

In conclusion. Appellants submit that the present disclosure fully supports the claims 
under 35 U.S.C. §§ 101 and 1 12, first paragraph. Further, Appellants disclosure is entitled to a 
priority date of no later than August 24, 2000. Accordingly, the cited references, which were not 
published more than a year prior to August 24, 2000, are not properly prior art under 35 U.S.C. 
§ 102(b). Hence, Appellants respectfully request that the Board reverse the Examiner's 
anticipation rejection of Claims 1-5 under 35 U.S.C. § 102(b). 
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E. Conclusion 

In view of the arguments presented above. Appellants submit that the specification as 
filed provides a specific, substantial and credible utility for the claimed antibodies and request 
withdrawal of the rejection under 35 U.S.C. § 101, and the related rejection under 35 U.S.C. 
§112. Appellants further submit that the claimed antibodies are fully enabled by the 
specification, and request withdrawal of the remaining rejection under 35 U.S.C. § 1 12. Finally, 
Appellants submit that the claimed antibodies are not anticipated by the cited references and 
accordingly request withdrawal of the rejection under 35 U.S.C. § 102(b). 

Please charge any additional fees, including any fees for additional extension of time, or 
credit overpayment to Deposit Account No. 1 1 - 1 4 1 0. 



Respectfully submitted, 



KNOBBE, MARTENS, OLSON & BEAR, LLP 




ArmeMarie Kaiser ( 
Registration No .^,649 
Attonjey of Record 
Customer No. 30,3 1 3 
(619) 235-8550 
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VIIL APPENDIX A - CLAIMS ON APPEAL 




1. (Previously presented) An isolated antibody that specifically binds to the 
polypeptide of SEQ ID NO: 78. 

2. (Original) The antibody of claim 1 which is a monoclonal antibody. 

3. (Original) The antibody of claim 1 which is a humanized antibody. 

4. (Original) The antibody of claim 1 which is an antibody fragment. 

5. (Original) The antibody of claim 1 which is labeled. 

6. (Canceled) 
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IX. APPENDIX B - EVIDENCE 

Attached hereto is a copy of the evidence cited ,in Appellants' Brief. The list of evidence 
below is accompanied by a statement setting forth where in the record that evidence was entered 
into the record by the Examiner. 



Tab 


Reference 


Submitted 


Entered 


1 

J. 


Fir^t Dpnlfirfition nf T 


Submitted as Rxhibit 1 with 


Not indicated as entered bv the 




Christopher Grimaldi 


Appellants' Amendment and 
Response to Office Action 
mailed Seotember 22 2004 


Examiner, but properly should 
have been entered as a matter 
of rieht in accordance with 37 
C.F.R. § 1.111 


2 


Second Declaration by 


Submitted as Exhibit 2 with 


Not indicated as entered by the 




T r^hri^tonhpr Crrimaldi 


Annellants' Amendment and 

Response to Office Action 
mailed September 22, 2004 


Examiner but nrooerlv should 

X^^mrV%X X XXXXX^X ^ L/%^V ^^XX^L^^XX ▼ kJXAVr vxxvx 

have been entered as a matter 
of right in accordance with 37 
C.F.R. § 1.111 


3 


Declaration of Paul 


Submitted as Exhibit 3 with 


Not indicated as entered by the 




Polakis, Ph.D. 


Appellants' Amendment and 
Response to Office Action 
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Baton, et al. 
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SECRETED AND 
TRANSMEMBRANE 
POLYPEPTIDES AND NUCLEIC 
ACIDS ENCODING THE SAME 
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David J. Blanchard 



Group Art Unit : 1 642 
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Commissioner for Patents 
P.O. Box 1450 

Alexandria, VA 22313-1450 



Dear Sin 



I, J. Christopher Grimaldi, declare and state as follows: 



1. I am a Senior Research Associate in the Molecular Biology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

« 

■ < 

and fnr^; ^ ^if^^^^T ^.^"^^^^ "^'"^^^ ^<'J^ding my list of publications, is attached .to 
and forms part of this Declaration (Exhibit A). 

3. 1 joined Genentech in January of 1999. From 1999 to 2003, 1 directed the Clonine 
Laboratory m the Molecular Biology Department During this time I directed or perfonned 
i^imerous molecular biology techniques including semiquantitative Polymerase Chain Reaction 
(PCR) analyses. I am cuixently involved, among other projects, in the isolation of genes coding 
for membrane associated proteins which can be used as targets for antibody therapeutics againsi 
cancer n comiection with the above-identified patent application, I personally performed or 

No^^^fi^r^'lTf "'^P'"^'^" ^"^y^^ ^" *^ ^^^y "Tumor Versus 

Normal Differential Tissue Expression Distribution," which is described in EXAMPLE 18 in the 

t w ^ • ^^'^ ^"^'^ "^^'^ ''^""^'^ differences in gene expression between tumor 

tissue and their noraial counteiparts. 

4 EXAMPLE 1 8 reports the results of the PCR analyses conducted as part of the 
investigating of several newly discovered DNA sequences. This process included developing 
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primers and analyzing expression of the DNA sequences of interest in normal and tumor tissues. 
The analyses were designed to determine whether a difference exists between gene expression in 
normal tissues as compared to tumor in the same tissue type. 

5. The DNA libraries used in the gene expression studies were made from pooled 
samples of normal and of tumor tissues. Data from pooled samples is more likely to be accurate 
than data obtained from a sample from a single individual. That is, the detection of variations in 
gene expression is likely to represent a more generally relevant condition when pooled samples 
from normal tissues are compared with pooled samples from tumors in the same tissue type, 

6. In differential gene expression studies, one looks for genes whose expression levels 
differ significantly under different conditions, for example, in normal versus diseased tissue. 
Thus, I conducted a semi- quantitative analysis of the expression of the DNA sequences of 
interest in normal versus tumor tissues. Expression levels were graded according to a scale of +, - 
, and +/- to iijdicate the amount of the specific signal detected. Using the widely accepted 
technique of PGR, it was determined whether the polynucleotides tested were more highly 
expressed, less expressed, or whether expression remained the same in tumor tissue as compared 
to its normal counterpart- Because this technique relies on the visual detection of ethidium 
bromide staining of PGR products on agarose gels, it is reasonable to assume that any detectable 
differences seen between two samples will represent at least a two fold difference in cDNA. 

7. The results of the gene expression studies indicate that the genes of interest can be 
used to differentiate tumor from normal. The precise levels of gene expression are irrelevant; 
what matters is that there is a relative difference in expression between normal tissue and tumor 
tissue. The precise type of tumor is also irrelevant; again, the assay was designed to indicate 
whether a difference exists between normal tissue and tumor tissue of the same type. If a 
difference is detected, this indicates that the gene and its corresponding polypeptide and 
antibodies against the polypeptide are useful for diagnostic purposes, to screen samples to 
differentiate between normal and tumor. Additional studies can then be conducted if further 
information is desired. 



8. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information or belief are believed to be true, and fixrther that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of the United States 
Gode and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 
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The t(5;14) Chromosomal Translocation in a Case of Acute Lymphocytic 
^Leukemia Joins the Interleukin-3 Gene to the Immunoglobulin Heavy Chain Gene 

By J. Christopher Grimaldi and limothY C. Meeker 



Qfiromosomal translooations have proven to be important 
nrverkers of the genetio abnormalities central to the patho- 
genesis of cancer. By cloning chromosomal breakpoints 
or^e can identify activated proto-oncogenes. We have stud- 
a case of B-lineage acute lymphocytic leukemia (ALL) 
theat was associated with peripheral blood eosinophilla. The. 
cfiromosomal translocation t(6;14) (q31:q32) from this 
s^miplo was cloned and studied at the molecular leveL This 

KARYOTYPIC STUDIES of leukemia and lymphoma 
have identiiied frequent nonrandom chromosomal 
translocaUons. Some of these translocations juxtapose the 
irmmunoglobulin heavy chain (IgH) gene with important 
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Fig 1. DNA blots of the leukemia sample. The restriction 
fragment pattern of normal human DNA (N) and the leukemia 
sample (L) were compared using a human Jh probe. Rearranged 
bands are indicated by arrows. Sample L exhibits a single rear* 
ranged band with both lll/fcoRI and 5aii3A restriction 
digests. The rearranged bands are less intense than the other 
bands because the majority of cells in the sample represent normal 
bone marrow elements. 



tranislooation Joined the immunogtobulin heavy chain Join- 
ing Uh) region to the promoter region of the interleukin-3 
<IU3) gene in opposite transcriptional orientations. The 
data suggest that acth/ation of the IL-3 gene by the 
enhancer of the Immunoglobulin heavy chain gene may play 
a central role In the pathogenesis of tliis iaulcemia and the 
associated eosinophilla. 
o 1989 by Grune & Stratton, Inc. 

m 

« 

protooncogenes, such as cmtyc and bcl-l?"^ In this way, the 
IgH gene can activate protooncogenes, resulting in disor- 
dered gene expression and a step in the development of 
cancer. The investigation of additional nonrandom transloca- 
tions into the IgH locus allows us to identify new genes 
promoting the generation of leuicemia and lymphoma. 

A distinct subtype of acute lymphocytic leukemia (ALL) 
has been characterized by B*>lineage phenotype, associated 
eosinophilia in the peripheral blood, and a t(5;14)(q31;q32) 
chromosomal translocation.^'^ This syndrome probably 
occurs in <1% of all patients with ALL. We hypothesized 
that the cloning of the translocation characteristic of this 
leukemia might allow the identification of an important gene 
on chromosome S that plays a role in the evolution of this 
disease. In this report we demonstrate that the interleukin-3 
gene (IL-3) and the IgH gene are Joined by this transloca- 
tion. 

MATERIALS AND METHODS 

Sample and DNA blots. A bone marrow aspirate from a repre- 
sentative patient with ALL (LI morphology by Frencb-Amcrican- 
Britlsb [FAB] criteria), peripheral eosinophilia (up lo 20,000 per 
microliter with a normal value of <350 per microliter) and a 
t(S;l4)(q3l;q32) translocation was studied. Using published meth- 
ods, genomic DNA was isolated and DNA blots were raade.^ Briefly. 
10 Mg of high molecular wdght (mol wt) DNA were digested using 
an appropriate restriction enzyme and elcarophorcscd on a 0,8% 
'agttVbse'gcL The gel was stained with ethidium bromide, photo- 
graphed, denatured, neutralized, and transferred to Hybond (Amer- 
sham, Ariington Heights, IL). After treatment of the filter with 
ultraviolet light, hybridization was performed. The filter was washed 
lo a final stringency of 0.2% saturated sodium cilrale (SSC) and 
0.1% sodium lauryly sulfate (SDS) and exposed to film. The human 
Jb probe has been previously reported.^ 

Genomic library. The genomic library was made using pub- 

From the Division of Hematolo^/Oncology, Department of 
Medicine, University of California, San Francisco, 

Submitted February 22, 1989: accepted March 8, 1989. 

Supported by NIH Grant No. CA01102. 

Address reprint requests to Timothy C. Meeker, MD. UCSF/ 
VAMC iUH. 4150 Clement St, San Francisco, CA 9412L 

Dr Grimaldi*s current address is Biostan Inc, 440 Chesapeake 
Dr, Seaport Centre, Redwood City, CA 94063. 

The publication costs of this article were defrayed in part by page 
charge payment. This article must therefore be hereby marked 
"advertisemenl" In accordance with 1 8 U.S.C. section 1 734 solely to 
Indicate this fact. 

(» 1989 by Grune d Stratton, Inc. 

0006-4971/89/7308'003i$3.00/0 



Shod, Vol 73, No 8 Uune), 1989: pp 2081-2085 



2081 




o 



2082 



hshed methods.* Approximately 100 m8 of high mol wl genomic 
DNA were partially digested with the Sau3A restriction en^c 
Fragments from 9 to 23 kilobases (kb) in size were isolated on a 
sucrose gradient and Ugated faito phage EMBUA (Strategeno, San 
Diego). Reoombinant phage were packaged, plated, and screened as 
previovsly reported.* 

DNA sequencing. Fragments for sequencing were cloned into 
Ml 3 vectors and sequenced by the chain termination method using 
Sequenase (United States Biochemical, Cleveland),' All sequence 
data were derived from both strands. 

* 

RESULTS 

We studied a bone marrow sample from a patient with 
ALL and associated peripheral eosinophilia. Karyotypic 
analysis showed the characteristic t(5;14)(q31;q32) translo- 
cation. These features define a distinctive subtype of ALL.''* 
The leukemic cells were analyzed for cell surface phenotypc 
by immunofluorecence. They were positive for 81 (CD20) 
B4 (0519). cALLA (CDIO). HLA-DR, and terminal 
deoxynucleotidyl transferase (Tdt), but negative for surface 
immunoglobulin. This phenotypic profile describes an imma- 
ture cell from the B-lymphocytic lineage.' 

The leukemia DNA was analyzed by Southern blotting for 
rearrangements of the IgH gene. Using a human iramuno- 
globulin Jh probe, a single rearranged band was detected by 
EcoKl, Hindlll, Sstl, Sau3A, and EcoVCL plus Hindlll 
restriction digests, suggesting rearrangement of one allele 
(Fig I). The immunoglobulin Jh reg;ion from the other allele 
was presumably either deleted or in the germllne configura- 
tion. 

We hypothesized that the t(5;l4)(q31;q32) juxtaposed a 
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growth-promoUng gene on chromosome 5 with the immuno- 
globulin Jh region on chromosome 14. Therefore, a genomic 
library was made from the leukemic sample and screened 
with a Jh probe. Fifteen distinct positive clones were isolated 
and sweened for the presence of the rearranged Sau3A 
fragment that was detected by DNA blotting. By this 
analysis, five clones appeared to represent the rearranged 
allele identified by DNA blots. One of these clones (clone no. 
4) was chosen for further study and a detailed restriction 
map was generated. The EcoRl HindiU/BcoKl and Sstl 
fragments from clone no. 4 that hybridized to the human Jh 
probe were also identical in size to the rearranged fragments 
from the leukemia sample, confirming that clone no. 4 
represented the rearranged leukemic allele. 

Phage clone no, 4 contained 3.7 kb of unknown origin 
joined to the IgH gene in the region of Jh4 (Fig 2). The IgH 
gene from Jh4 to the Cmu region appeared to be in germline 
configuration. Previously, the gene encoding hematopoietic 
growth factor 11^3 had been mapped to chromosome 5q3 1 so 
it was suspected that clone no. 4 might contain part of this 
genc,*^" When the restriction map of human IL-3 and clone 

no. 4 were compared, they were identical for more than 3 kb 
(Fig 2). 

We confirmed the juxtaposition of the IL-3 gene and the 
IgH gene by nucleic acid sequencing of the subcloned 
BstEll/Hpal fragment (Fig 2). The sequence of this frag- 
ment showed no disruption of the protwn coding region or the 
messenger RN A of the ILO gene. The break in the IL-3 gene 
occurred in the promoter region, 452 base pairs (bp) 
upstream of the transcriptional start site (position 64, Fig 
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T(5; lO CHROMOSOMAL TRANSLOCATION 

• 

3 A.>. T^li^ break in the IgH gene occurred 2 bp upstream of 
the Jb4 region. Between the two breaks, 25 bp of uncertain 
origin (putative N sequence) were inserted J'*" No sequences 
homologous to the immunoglobulin heptamer and nonamer 
coiB.ld be identified In the IL-3 sequence (Fig 3B). Therefore, 
nuol&ic acid sequencing confirmed the juxtaposition of the 
gene and the IgH gene. The sequence data clearly 
shoved that the genes were positioned in opposite trans<7ip- 
tio^l orientations (head-to-head). 

v^vailable data also allowed us to determine the normal 
positions of the IL-3 gene and the GM-CSF gene in relation 
to 'the centromere of chromosome 5 (Fig 4). The IgH gene is 
knovsrn to be positioned with the variable regions toward the 
teionnere on chromosome 1 4q.^" It has also been shown that 
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GM-CSP maps within 9 kb of IL-3 in the same transcrip- 
tional orientation.^^ Using this information and assuming a 
stimple translocation event in our sample, we can conclude 
that the IL-3 gene is normally more centromeric, and the 
GM^SF gene, more telomeric on chromosome Sq (Fig 4). 
Furthermore, both are transcribed with their 5' ends toward 
the centromere. 

DISCUSSION 

In this report we have cloned a unique chromosomal 
translocation that appears to be a consistent feature of a rare, 
yet distinct* clinical form of acute leukemia. This transloca- 
tion joined the promoter of the IL-3 gene to the IgH gene. 
Except for the altered promotor, the IL-3 gene appeared 



5 ' G<n!GACC AGGGTTCCCTGKCCC AGTAGTCAAAGTAGTAGAGCTiATTC ATCATAGCTGCGGAOT o ^ 

3 • CCACTGGTCCCAAGGGACCGGqGTCATCAGTTTCATC ATCTCCATTAAGTAGTATCGACGCCTAAlK:GTCGCACT^ 

• • • • 

5 ' TACC AGACAAACTCTC ATCTGTTCCAGTGGCCTCCTGGCC ACCCACCAG6ACCAAGCAGGGCGGGCAGC AGAGGGCC A , g „ 
S'ATGGTCTGTXTGAGAGTAGACAAGGTCACCGGAGGACCGGTGGGTGGTCCTGGTTCGTCCCGCCCGTCGTCTC^ 

********* « • * 

5 ' GTAGTCCAGGTGATGGCAGATGAGATCCCACTGGGCAGGAGGCCTCAGTGAGCTGAGTCAGGCTTCCCCTTC 240 
3 ' CATCAGGTCCACTACCGTCTACTCTAGGGTGACCCGTCCTCCG6AGTCACTCGACTCAGTCCGAAGGG 

> 

• • • • 

5 ' GGGGTCCTCTCACCTGCTGCCATGCTTCCCATCTCTCATCCTCCTTGAC AAGATC 320 
3 • CCCCAGGAGAGTGGACGACGGTACGAAGGGTAGAGAGTAGGAGGAACTGXTCTACTWaiCTATGGCAAAT^ 

• • • • 

5 « TTTCTTGTTTC ACTGATCrPGAGTACTAGAAAGTCATGGATGAATAATTACGTCTGTGqTTT^ 

3 ' AAAGAACAAAGTGACTAGAACTCATGATCTTTC AGTACCTACTTATTAATGCAGACACCAAAAG 

■ 

5 • CAGATAAAGATCCTTCCGACGCCTGCCCCACACCACCACCTCCCCCCGCCTTGCCC^ ^gg 
3»GTCTATTTCTAGGAAGGCTGCGGACGGGGTGTGGTGGTGGAGGGGGGCGGAACGGGCCCCAACACCCGTGGAACGACGAC 

5 • CAC ATATA AGGCGGGAGGTTGTTGCCAACaXTTCAGAGCCCCACGAAGGACCAGAACAAGACAGAGTGCCTCCTGCCGAT ^^^^ 
3 ' GTGTATATTCCGCCCTCCAACAACGGTTGAGAAGTCXCGGGGTGCTXCCTGGTCTTGTTC 

5 ' CCAAACMGAGCCGCCTGCCCGTCCTGCTCCTGCTCCAACTCCTGGTCC&CC^^ ^^^^ 
3 • GGTTTGTACTCGGCGGACGGGC AGGACGAGGACGAGGTTGAGGACCAGGCGGGGCCTGAGGTTCGAGGGTACTGGGTC^ 



5 ' AACGTCCTTGAAGACAAGCTGGGTTAAC 3 ' 
3 • TTGCAGGAACTTCTGTTCGACCCAATTG 5 ' 
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5 • TGGCCCCAGTAGTCAAAGTAGTCACATTGTGGGAGGCCCCATTAAGGGGTGCACAAAAACCTGACTCTC 
3 • ACCGGGGTCATCAGTTTCATCAGTGTAACA CCCTCCGGGGTAATTCCCCACGTGTTTTTgGACTGAGAG 

++++++++++++++++++++++ 

5 • TGGCCCCAGTAGTCAAAGTAGTAGAGGTAATTCATCATAGCTGCGGATTAGCAGCGTGACCGGCTACCA 
3 • i^rrr=r^r!r-rprATriir:TTTrATr AVqTCCAa?TAAGTAGTATCGACGCCT AATCGTCGCACTGGCCGATGGT 

++++++++++++++++++-»•+++ 

5 • GGCACCAAGAGATGTGCTTCTCAGAGCCTGAGGCTGAACGTGGATGTTTAGCAGCGTGACCGGCTACCA 
3 • CCGTGGMCTCTACACGAAGAGTCTCGGACTCCGACTTGCACCTACAAATCGTCGCACTGGCCGATGGT 



Fig 3. Sequence of t|5:14)(q31;q32) breakpoint region. (A) Nucleotide eequence of the BziXXilHpaX fragment indicated on Fig 2. 
Nucleotides 1 to 36 represent the Jh4 coding region underlined on the coding strand.* Nucleotides 39 to 63 are a putative N region. The 
sequence from posHlon 64 to 668 Is that off the germline IL-3 gene." The IL.3 TATA box 1465), transcription start (515), and Initiation 
methionine (667) are underlined. Two proposed regulatory sequences In the promotor are marked by asterisks (positions 1 82 and 389). (B) 
Comparative sequence of the t(6;14)(q31;q32) breakpoint region. The l9Jh4 region is shown with its coding region, heptamer, and 
nonamer undeHlned. Clone no. 4 is shown with putatWe N region sequences underlined. The lL-3 sequence is also shown. A plus sign {+) 
denotes the Identical nucleotide between sequences. No heptamer or nonamer is identified in the lt-3 sequence. 
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Fig 4. Diagram of the translocation, Tho normal chromosome 
5q31 i$ shown with tho GM-CSF gene tatomerio to the IL-3 gene in 
the transcriptional orientation shown. On normal chromosome 
14q32 the Vh regions are telomeric. The t|6;14Hq31 ;q32) translo- 
cation results In the head-to-head oHentation of these genes. 
Symbols are defined in Fig 2. BP, breakpoint posttion. 



intact as no deletions, insertions, or point mutations were 
detected by restriction mapping of the entire gene and 
sequencing of part of the gene. The IgH gene has been 
truncated at the Jh4 region, which places the immunoglobu- 
lin enhancer within 2.5 kb of the IL-3 gene."*" This leads to 
the hypothesis that the enhancer is increasing transcription 
of a structurally normal IL-3 gene. The same mechanism is 
important for activation of the i>myc gene in some cases of 
Bwrkitt's lymphoma,** An alternate hypothesis is that the 
elimmation of an upstream lL-3 promoter element is crucial 
to the activaUon of the IL-S gene. 
The proposed activation of the IL-3 gene suggests that an 

important for the pathogenesis of this 
i^^ukemia. Over-expression of the IL-3 gene coupled with 
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the presence of the IL-3 receptor in these cells could account 
for a strong stimulus for proliferation. In this regard^ there 
are data indicaUng that immature B-lineage lymphocytes 
and B-lineage Icukemias may express the IL-3 receptor,*"'^ 

M additional feature of this type of leukemia is the 
dramatic eosinophilia, consisting of mature forms. It has 
been hypothesized that the eosinophils do not arise from the 
malignant done, but are stimulated by the tumor,^*** 
Because of the known effect of IL-3 on eosinophil differentia- 
tion, secretion of high levels of IL.3 by leukemic cells might 
have a role m the eosinophilia m this type of leukemia. 

The data suggest that the recombination mechanism that 
is active in the IgH gene during normal differentiation has a 
role in this translocation."'" This is supported by the break- 
point location at the 5' end of Jh4 and tiie presence of 
putative N-rcgion sequences. On the other hand, no recombi- 
iiation signal sequence (heptamer and nonamer) was found 
in this region on chromosome 5/suggesting tiiat additional 
factors also played a role. Further studies will elucidate the 
mechanism of this and other translocations.. 

In the leukemia we studied, it is possible tiiat tiie immuno- 
globulin enhancer also activates the GM-CSF gene, since 
this gene is probably positioned only 14 kb away (Fig 4). This 
is known to be within the range of enhancer activation.^ The 
interleukin-S (lL-5) gene maps to chromosome 5q31.2« 
Deregulation of the IL-5 gene by this translocation would act 
synergistically with lL-3 in the stimulation of eosinophil 
proliferation and differentiation." These and other questions 
will be answered by the study of more patient samples; We 
plan to determine whether the t(5;14)(q31;q32) transloca^ 
tion is capable of activating multiple lymphokines sunulta^ 
neously and whether they cooperate in the generation of this 
leukemia. 
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RAPID COMMUNICATION 

Activation of the Iiiterieukin>3 Gene by Chromosome Translocation in Acute 

Lymphocytic Leukemia With Eosinophilia 

• 

By Timothy C. Meeker, Dan Hardy, Cheryl NATillman, Thomas Hogan, and John Abrams 

acthflty. There was no evidence of excess granulocyte/ 
macrophage colony stimulating factor (GM-^CSF) or IU5 
expression. Our data support the formulation that this 
subtype of leulcemla may arise in part because of a 
chromosome translocation that activates the IW gene, 
resulting in autocrine and paracrine growth effects, 
© 19B0 by The American Society of Hematology, 
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16.6 mmol/L ammonium sulfate^ 1 .5 mmol/L each dNTP and Taq 
polymerase (Ferldn-Elmer» Norwalk, CT)." 

Sequencing. Sequendng was done by chain termination in M 1 3 
vectors." As part of this study, we sequenced a subclone of a normal 
IL-3 promoter, covering 598 base pairs from a Sma I site at position 
- 1240 (with respect to the proposed sit&of transcriptton initiation) 
to aniV^ I site at position -642. The plasmid containing this region 
was a gift from Naoko Arai of the DNAX Research loslitutc. 

Expression in Cos? cells, A genomic IL-3 fragment from Case 1 
was cloned into the pXM expression vector,'* Briefly, the HindUl/ 
Sal I fragment containing the IL-3 gene was subcloned from the 
previously .described phage clone 4 into pUClS.^ The 2.6 kb 
fragment extending from the Sma I site 61 bp upstream of the IL-3 
transcription start to the Sma I site in the polylinker was cloned into 
the blunted Xho I site of pXM. The negative oonUol constroct was 
the pXM vector without insert Plasmids were introduced faito Cos? 
cells by electroporation, and supernatant was collected after 48 
hours in culture. 

TFl bioassay. TF-1 cells were passaged in RPMI 1640 supple- 
mented with 10% heat-inactivated fetal bovine serum, 2 mmol 
L-glutamine,and 1 ng/mLhumanOM-CSF.»* Samples and antibod- 
ies were diluted in this same medium lacking GM-CSF but contain- 
ing penicillin and streptomycin. A 25 /iL volume of serial dilutions of 
patient serum was added to wells in a flat bottom 96-W611 microtiter 
plate. Rat anti-cytokine monoclonal antibody in a volume of 25 mL 
was added to appropriate wells and preincubated for 1 hour at 37*>C. 
Fifty microliters of twice washed TF-1 cells wwe added to each well, 
giving a final cell concentration of 1 x. 10* cells per well. (final 
.volume. 100 ^L). The plate was incubated for 48 hours. Hie 
remaining cell viability was determined metaboUcally by the colori- 



The t|6;14)(q31;q32) translocation from B-iineage acute 
lymphocytic leukemia with eosinophilia has been cloned 
from two leukemia samples. In both cases, this transloca- 
tion joined the IgH gene and the lnterleuldn-3 (IL-S) gene. In 
one patient, excess IL-3 mRNA was produced by the 
leukemic cells. In the second patient, serum JL-3 levels 
were measured and shown to correlate whh disease 

A NUMBER OF chromosome translocations have been 
associated with human leukemia and lymphoma. In 
many cases the study of these translocations has led to the 
discovery or characterization of proto-oncQgenes, such as 
bcl-2, crabl, and o-myc^ that are .located adjacent to the 
translocation,*** It is aow widely understood that cancer- 
associated translocations disrupt nearby proto-oncogenes. 

A distipct subtype of acute leukemia is <^aracterized by 
the triad of B-lineage immunophenotypc, eosinophilia, and 
the t(5;14)(q31;q32) translocaUon.^*^ Leukemic ceils from 
such patients have been positive for terminal deoxynucleotidyl 
transferase (Tdt), common acute lymphoblastic leukemia 
antigen (CALLA), and CD19, but negative for surface or 
qrtoplasmic inununoglobulin. In previous work, we cloned 
the t(5;14) breakpoint from one leukemic sample (Case 1) 
and determined that the IgH and interleukin-3 (IL-3) genes 
were joined by this abnormality.^ In this report, we extend 
those findings by showing that the t(5;14)(q31;q32) translo- 
cation from a second leukemia sample (Case 2) has a similar 
structure, and wc report our study of growth factor expres- 
sion in these patients. 

MATERIALS AND METHODS 

Samples and Southern blots. Case 1 has been described.*^ 
Clinical features of Case 2 have been described in detail.' DNA 
isolation and Southern blotting was done using previously described 
methods.* FUters were hybridized with an immunoglobulro Jh probe, 
a 280 bp Bamm/EcoKl genomic IL-3 fragment, and an IL.3 
cDNA probe."'-* 

Northern blots, RNA isolation and Northern blotting have been 
described.* Briefly, Northern blots were done by separating 9Mg 
total RNA on \% agarose-formaldchyde gels. Equal RNA loading hi 
each lane was confirmed by ethidium bromide staimng. Blots were 
hybridized with an IL-3 cDNA probe extending to thtXho 1 site in 
exon 5, a 720 bp Sst l/Kpn I probe derived from intron 2 of the IL-3 
gene, a 600 bp Nhe l/Hpa I lL-5 cDNA probe, and a 500 bp Pst 
l/Nco I granulocyte-macrophage colony stimulating factor (GM- 
CSF) cDNA probe."»-»» 

Polymerase chain reaction. Frimers were designed with BamHl 
sites for cloning. One primer hybridized to the Jh sequences from the 
IgH gene (Primer 144:5'-TAGGATCCGACGGTGACCAGGGT). 
and the other hybridized to the region of the TATA box in the IL-3 
gene (Primer 161: y-AACAGOATCCCGCCTTATATGTGCAG). 
Polymerase chain reaction (FCR) (95*»C for 1 minute, 6PC for 30 
seconds, and 72'»C for 3 minutes) was done usbg 500 ng genomic 
DNA and 50 pmol of each primer in 100 fiL containing 67 mmol/L 
Tris-HCl pH 8.8, 6.7 mmol/L Mgaj, 10% dunethyl sulfoxide 
(DMSO), 170 Mg/mL bovine serum albumin (BSA) (fraction V), 
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1. Broakpolnt sequences for Case 2. The germllne rgJh5 region sequence (protein codino mtikwi u 
sequences are underfined) b on top, the tnmslocallon sequence from Case 2 (PCR pHmer seQuences hndnS^ifl^ ^mWnation signaj 
Is in the mWdre. and the germllne sequence, which we derh^ed frtnn a normal J^^^^^^^^T^^ "ndertiaod) 

sequence has the same nucleotide. The sequence documents the head-t<Mieadiolfilng of the IL-3 and lnH aen^hAr^Li^' 

gene occurred at position -934 {•), ^ genes. The breakpoint in tfte IL-^ 
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metric method of Mosmann using a VMax microtiter plate reader 
(Molecular Devices, Menlo Park, CA) set at 570 and 650 nm." 

Q^tokine immunoassays. These assays used rat monoclonal 
anti-cytoldne antibodies (10 Mg/mL) to coat the welb of a PVC 
miciotiter plate. The capture, antibodies usad were BVD3-6G8» 
JES1-39D10. and BVD2-23B6, for the IL-3. 11^5, and GM-CSF 
assays, rcspecUvcly. Patient sera were then added (undiluted and 
diluted 1:2 for IL*3, undiluted for 11^5, and undiluted and dUuted 
1'^ for GM-CSF). The detecting inununorcagents used were either 
mouse antiserum to IL-3 or niUoiodophenyl (NIP)-derivatizcd rat 
monoclonal antibodies JES1-5A2 and BVD2-2JC11, specific for 
IL-5 and GM-CSF, respectively. Bound antibody was subscquentiy 
detected with immunopcroxidase conjugates: horeeradish peroxidase 
(HRP)-labeled goat anti-mouse Ig for IL-3, or HRP-Iabeled rat (J4 
MoAb) anti-NIP for lL-5 and GM-CSR The chromogenic sob- 
strate was 3-3'azino-bis-ben2thiazoline sulfonate (ABTS; Sigma. St 
Louis, MO). Unknown values were interpolated from standard 
curves prepared from dilutions of the recombinant factore using 
Softmax software available with the VMAX mxcroplate reader 
(Molecular Devices). 

RESULTS 

Leukenuc DNA from Case 2 was studied by Southern 
blotUng. When digested with the /findlll restriction enzyme 
and hybridized with a human immunoglobulin heavy chain 
joinuig region (Jh) probe, a rearranged fragment at approxi- 
. mately 14 kb was detected (data not shown). When reprobed 
with either of two different IL-3 probes, a rearranged 14 kb 
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fragment, ^migrating with the rearranged Jh fragment, was 
idenUfied^ Whea leukemic DNA was digested with iyi„dIII 
puis £coRI, a rearranged Jh fragment was detected a t 6 kb 
The IL.3 probes also identified a comigrating fragment of 
this size. These experiments Indicated that the leukemic 
wmple studied was clonal and that a single fraement 
contamed both Jh and IL.3 sequences, suggesting a translo- 
caUon had occurred. 

To charactnize better the joming of the 11^3 gene and the 
mmunogtobulm heavy cham (IgH) gene, the polymerase 
chaw reacbon (PC^ was used to clone the translocation." 

primer were designed to produce an 
amplified product in the event of a head-to-head transloca- 
tton. While control DNA gaveno PCR product. Case 2 DNA 
yielded a PCR-derived fragment of approximately 980 bp 
which was cloned and sequenced. 

Tlie DNA sequence of the translocation done from Case 2 
confirmed the joining of the Jh region with the promoter of 
the 1L.3 gene in a head-to^head configuration (Fie n 
Sequence analysis mdicated that the breakpoint on chromi 
some 14 was just upstream of the .JhS coding r(»ion The 
breakpoint on chromosome 5 occurred 934 bp upstream of 
the putetire site of transcription initiation of the IL-3 gene 
We also determined that a putative N sequence of 17 bp was 
inserted between arch^Smosome 5 and chromosonTe 14 
sequences during the translocation event."-" Figure 2 shows 
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Fl9*. RolatlonsWp of cliromos«meBbreakpolnt» to the IL-3 gen*. Thb figure shows the two o«onodbrBaVBoin»o/o >. 

tho normal 11^ g-no."' Ono br«ikp1nt ocmTrod at posMon -4«2 and th« whor at -^M (arrowt^fn ^^^^^ " '"^'^ *° 

transloeatlons resultod bi a hea<M<»4t«ad Joining of the IgH gene and the IL-3 geno. leaving the mRNA end oroteb^ I!"."'"""'**' *• 
gene Intact. Boxes denote the five VL-9 axons; restriction oniyroes are (B) 9smHL (P) Pst I. (HJ Hpa I. (EleLw and W Afco I '^^^'^'^ 
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FI9 3. Docutnantation of IL-3 mRNA ovar-«xpresslon. A NmtiMm Mot was prepared and hybridized whh a nralu. for 11 •» 1 < 
«r«..n«dRNAfro«Mns«mutotedperlph,ralbl^ 

m ,1 "f 3 «mtaln«l RNA fr<Hn PBL atlmubtwi whh ConA f or 48 IwurTiLTrrtile port^^^ 

»^J^?H J'^"*!!' 1" r '^f^ tha mature 1 kb transcript by rtowino thattSs band h;bridbed to7p«*a 

2 Of the B.-3 sane A Btmilar 24» kb band was detected in lane 2. auoaertln, that an IL-3 mRNA of thliaW^h w^IJC 

^^^^ "*^*'»:«>'""'«ted cell*. Una 8 throuflMO represent RNA from aU samples of IMimMs* bJu^^^S^^^^^S^ 

^aH^ *J ' u' *":»'»«'«°"' '"««»«"8 ^h"* on«y tha wmple whh the tranalooation axMMted lb« ovar JU««3Sa^«kuSj£ 

snalyzed by Northern blot because too few calls waroavailabia for atudy. »i>ro»sion.«^8e»couwnotBe 
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*he locations of the two cloned breakpoints in relation to the 
JL-3 gene. The two chromosome 5 breakpoints were sepa- 
arated by less than 500 bp. 

The genomic structure in Cases 1 and 2 suggested that a 
jiormal IL-3 gene product was over-expressed as a result of 
the altered promoter structure. This would predict that the 
IL-3 gene on the translocated chromosome was capable of 
jualdng IL-3 protein. This prediction was tested by express- 
ing a genomic fragment from the translocated allele of Case 
I containing all live IL-3 exons under the control of the S V40 
proraotor/enhancer in the Cos? cell line. Cell supematants 
vferc studied in a proliferation assay using the factor depen- 
dent erythroleukemic cell line, TF-1. The supematants* 
derived from transfections using the vector plus insert 
supported TF-l proliferation, while supematants firom trans- 
fections using the vector alone were negative in this assay 
(data not shown). Furthermore, the biologic activity could be 
blocked by an antibody to human IL-3 (BVD3-6G8). This 
result showed that the translocated allele retained the ability 
to make IL-3 mRNA and protein. 

The level of expression of lL-3 mRNA in leukemic cells 
from Case 1 was assessed. Northern blotting showed that the 
mature IL-3 mRNA (approximately 1 kb) and a 2.9 kb 
unspliced 11^3 mRNA were excessively produced by the 
leukemia (Fig 3), The 2.9 kb form of the mRNA is also 
present at low levels in normal peripheral blood T lympho- 
cytes after mitogen activation (Fig 3). Several B-lineage 
acute leukemia samples without the t(5;14) translocation 
had undetectable levels of IL-3 mRNA in these experiments. 
In addition, although genes for GM-CSF and IL-5 map close 
to the 11^3 gene and might have been deregulated by the 
translocation, no IL-5 or GM-CSF mRNA could be detected 
in the leukemic sample (data not shown). 

Three serum samples from Case 2 were assayed by 
inmiunoassay for levels of IL-3, GM-CSF, and IL-5 (Table 
1). Semm IL-3 could be detected and correlated with the 
clinical course. When the patient's leukemic cell burden was 



highest, the IL-3 level was highest. No semm GM-CSF or 
IL-5 could be detected. 

Since the IL-3 immunoassay measured only immunoreac- 
tive factor, we confimed that biologically active IL-3 was 
present by using the TF-1 bioassay. This bioassay can be 
rendered monospecific using appropriate neutralizing mono- 
clonal antibodies specific for IL-3, IL-5, or GM-CSF. We 
observed that sera from 1-16-84 and 3-14-84 contained TF-1 
stimulating activity that could be blocked with anti-IL-3 
MoAb (BVD3-6G8), but not with MoAbs to IL-S (JESl- 
39D10) or GM-CSF (BVD2.23B6) (Fig 4; GM-CSF data 
not shown). The amount of neutralizable bioactivity in these 
two samples correlated very well with the difference in IL-3 
levels obtained by immunoassay for these samples. Further- 
more, the failure to block TF-1 proliferating activity with 
either ant|-IL-5 or anti-GM;CSF^was consistent with the 
inability to measure these factors by immunoassay .and 



Tabl« 1. Peripheral Blood Counts and Growth Factor Uvela 

at Different Times bi Case 2 

Samplo Data 





1 1/16/83 


1/16/84 


3/14/84 


Peripheral blood counts (cells/;xL) 








WBC 


81,800 


116,600 


12,300 


Lymphobtasts 


0 


33J86 


0 


Eosinophils 


46,626 


73,080 


616 


Serum growth factor levels (pg/mL) 






« 


tL-3 


<444 


7,995 


1,061 


6M-CSF 


<16 


<16 


<I6 


IL-5 


<50 


<50 


<60 



Peripheral blood counts from Case 2 at Uureo different time points with 
the correspondino growth factor levels quantified fay immunoassay. The 
patient received chemotherapy between 1/16/84 and 3/14/84 to lower 
his leukemic burden.^ No serum samples were available for a similar 
analysis of Case 1. 

Abbre^atlon: WBC, white btood cells. 
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indicated that these other myetoid growth factors were not 
dctectably circulaUng in the serum of this patient 

DiSCUSsrON 

In this report, we have extended our analysis of acute 
lymphocytic leukemia and eosinophilia associated with the 
t(5;14) translocation. In both cases we have studied, we have 
documented the joining of the IL-3 gene from chromosome 5 
to the IgH gene from chromosome 14. The breakpoints on 
chromosome 5 are within 500 bp of each other, suggesting 
that additional breakpoints will be clustered in a small region 
of the IL-3 promoter. The PGR assay we have developed wiU 
be useful in the screening of additional clinical samples for 
this abnormality. 

The finding of a disrupted IL-3 promoter associated with 
an otherwise normal IL-3 gene implied that this transloca-. 
tion might lead to the over-expression of a normal IL.3 gene 
product. In this work, we have documented that this is true 
In addiuon, neither GM-CSF nor !l-5 arg over-expressed by 
the leukemic cells. Furthermore, in one patient, serum IL-3 
could be measured and correlated with disease activity. To 
our knowledge, this is the first measurement of human IL-3 
in serum and its association with a disease process. The 
measurement of serum IL-3 in this and other clinical setUngs 
may now be indicated. 



The findmg of the IL.3 gene adjacent to a cancer- 
associated translocation breakpoint suggests that its activa- 
tion is important for oncogenesis. It is our thesis that an 
autocrine loop for 11^3 is important for the evolution of tiiis 
leukemia." The excessive IL-3 production tiiat we have 
documented would be one feature of such an autocrine loop 
The final proof of our Ojesis must await additional data In 
particular, from the study of additional clinical samples' it 
will be necessary to document tiiat the IL-3 receptor is 
present on' the leukemic cells and that antir-lL-3 antibody 
decreases proUferation of the leukemia in vitro. 

An important aspect of tiiis work is tht suggestion of a 
therapeutic approach for this disease. If an autocrine loop for 
IL-3 can be documented in tiiis disease, attempts to lower 
circulatmg IL-3 levels or block tiie interaction of IL-3 with 
Its receptor may prove useful. Because it is also possible that 
tilt? eosinophilia in these patients is mediated by tiie para- 
crme effects of leukemia-deriv.^ nt^3, similar interventions 
may improve tiiis aspect of Uie disea^. Antibodies or " 
engineered ligands to accomplish tiiese goals may soon be 
available. 
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Clinical and Pathologic Significance of the 
c-erbB-2 {HER'2/neu) Oncogene 

Timothy P. Singleton and John G. Strickler 



The c-^2»B-2 oncogene was first shovm to have clinical significance in 1987 by 
Slainon et al,*™ who reported that c^erbB-Z DNA amplification in breast.carcino-, 
mas correlated with decreased survival in patients with metastasis to ariUary 
lymph nodes. Subsequent studies, however, of c-er6B-2 activation in breast 
carcinoma reached conflicting conclusions about its clinical significance. This 
oncogene also has been reported to have clinical and patholo^c implications in 
other neoplasms. Our review summarizes these various studies and examines 
the clinical relevance of c-er6B-2 activation, which has not been emphasized in 
recent TQviows.^^^^ The molecular biology of the c-er6B-2 oncogene has been 
extensively revlewed*^.^.® and will be discussed only briefly here. 

I 

BACKGROUND 

The o-erirfi.2 oncogene was discovered in the 1980s by three lines of investlga- 
tion. The neu imcogene was detected as a mutated transforming gene in 
neuroblastomas induced by ethylnitrosurea treatment of fetal rats.**^^^^^ The c- 
erbB-2 was a human gene discovered by its homology to the retroviral gene v- 
erfcB.».49.« H£A-2 was isolated by screening a human genomic DNA library for 
homology with v-erbB.^ When the DNA sequences were determined subse- 
quently, c-erfcB-2, and new were found to represent the same gene. 

Recently, the C'erfcB-2 oncogene also has been referred to as NGL, 

The c.erfcB-2 DNA is located on human chromosome 17q21"'»« and codes 
for c.eriB-2 mRNA (4.6 kb), which translates c-er6B-2 protein (pl85). This 
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protein is a norma] component of cytoplasmic membranes* The c-erbB^i 
oncogene is homologous with, but not identical to, c-erbB-1, which is located 
on chromosome 7 and codes for the epidermal growth factor rcccptor.^^^Thc c- 
erbB*2 protein is a receptor on cell membranes and has intracellular tyrosine 
kinase activity and an extracellular binding domain>^^ Electron microscopy 
with a polyclonal antibody detects o-er&B-2 immunoreactivity on cytoplasmic 
membranes of neoplasms, especially on microvilli and the non-villous outer cell 
membrane.^ In normal cells, immunohistochemical reactivity for oerbB-i is 
frequently present at the basolateral membrane or the cytoplasmic membrane s 
brush border 

There is experimental evidence that &erbB-2 proteb may be involved in 
the pathogenesis of breast neoplasia. Overproduction of otherwise normal c- 
erbB-2 protein can transform a cell line into a malignant phenotype.^ Also, 
when the neu oncogene contaifTin^liii activating point mutation is placed in 
transgenic mice with a strong promoter for increased expression, the mice 
develop multiple independent mammary adenocaixsinomas.^^'^ In other experi- 
ments, monoclonal antibodies against tiie neu protein inhibit the growth (in 
nude mice) of a n&u-transformed cell Une,^^ and.immunization of mice with 
neu protein protects them from $ub5equent tumor challenge with the neu- 
transformed cell line.^^ Some authors have speculated that the use of antago- 
nists for the unknown ligand could bo usoiul in future chemotherapy.^ Further 
review of this experimental evidence Is beyond the scoi>e of this article. 

The c-erbB-2 activation most Ukely occurs at an early stage of neoplastic 
development* This hypothesis is supported by the presence of c^6B-2 activa- 
tion in both in situ and invasive breast carcinomas. In addition, studies of 
metastatic breast carcinomas usually demonstrate uniform <herhB-2 activation 
at multiple sites in the same patient, "''^'^'"'^^ although c-€rfeB-2 activation has 
rarely been detected in metastatic lesions but not in the primary tumor;"'®**" 
Even more rarely, c«er&B-2 DNA amplification has been detected in a primary 
breast carcinoma but not in its lymph node metastasis.^ In patients who have 
bilateral breast neoplasms, both lesions have similar patterns o{Q-erbl&^2 activa- 
tion, but only a few such pases have been studicd.^^ 




MECHANISMS OF ACTIVATION 
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The most common mechanism of c-erbB-2 activation is genomic DNA ampliRca- 
tion» which almost always results in overproduction of c^ar^B-2 mRNA and 
protein, iT,a4.65,Bi c-erbB-:2 amplification may stabilize the overproduction of 
mRNA or protein througl) unknown mechanisms. Human breast carcinomas 
with c-er&B<-2 amplification contain 2 to 40 times more c-^r&B''2 DNA^^ and 4 to 
128 times more c-erbB'2 mRNA^»^ than found in normal tissue. Most human 
breast carcinomas with c~er&B-2 amplification have 2 to 15 times more c-er&B-2 
DNA. T\imors with greater amplification tend to have greater overproduc- 
tion. ^"^'^'^ The non-mammary neoplasms that have been studied tend to have 
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similar levels of c-erbB'Z amplification or overproduction relative to the corre- 
sponding normal tissue. 

The second most common mechanism of c^&B-2 activation is overproduce 
tion of c*6r&B*2 mRNA and protein without amplification of o^rbB-i DNA.^^ 
The quantities of mtRNA and protein usually are less than those in amplified 
cases and may approach the small quantities present in nonnal hreast or other 
tissues. i^'K^a cerbB-Z protein overproduction vdthout mRNA overproduc- 
tion or DNA amplificatioQ has been described in a few human breast carcinoma 
ceU lines.^ 

Other rare mechanisms of oerbB^Z activation have been reported. Translo- 
cations involving die o-er&B-2 gene have been described in a £sw mammary and 
j gastric carcinomas, although some reported cases may represent restriction 

fragment length polymorphisms or incomplete restrtctton Enzyme digestions 
that mimic translocations.^^*^'*^^^'^® A slrigle point mutation in the transmem- 
brane portion of neu has been described in rat neuroblastomas induced by 
ethylnitrosureai'*^ The mutated neu protein has increased tyrosine kinase activ- 
ity and aggregates at the cell membrane. "^^^ Although there has been specula- 
tion that some of the amplified c-erfeB-2 genes may contain point mutations,^ 
none has been detected in primary human neoplasms*^^'^'^^ 

TECHNIQUES FOR OETECTINQ C-9rbB-2 ACTIVATION 

■ 

Detection of c«erbB*2 DNA AmplH icatton 

Amplification of o-gr6B-2 DNA is usually detected by DNA dot blot or South- 
em blot hybridization. In the dot blot method, the extracted DNA is placed 
directly on a nylon membrane and hybtidized with a c-erbB-Z DNA probe. In 
the Southern blot method, the eactracted DNA is treated with a restriction 
enzyme » and the fragments are separated by electrophoresis, transferred to a 
nylon membrane, and hybridized with a c^erbB-Z DNA probe. In both tech- 
t niques, o-erbB-2 amplification is quantified by compiaxing the intensity (thea- 

^ sured by densitometry) of the hybridization bands from the sample with those 
from control tissue. 

■ 

Several technical problems may complicate the measurement of c-er^B-2 
DNA amplification. First, the extracted tumor DNA may be excessively de- 
' . graded or diluted by DNA from stromal cellsi^^ Second, the c-erbB-2 DNA 

probe must be carefully chosen and labeled. For example, oligonucleotide 0- 
er6B-2 probes may not be sensitive enough for measuring a low level of c-er&B- 
i 2 amplification, because diploid copy numbers can be diificult to detect (unpub- 

; lished data). Third, the total amoimts of DNA in the sample and control tissue 

• must be compensated for, often with a probe to an unamplified gene. Many 

studies have used control probes to genes on chromosome 17, the location of c- 
<?r6B-2, to correct for possible alterations in chromosome number Identical 
. results, however, are obtained by using control probes to genes on other chro* 

1 mosomes,^*^'^ with rare exception.'^ Studies using control probes to the beta- 
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globin gene must be xnteipreted with caution, because one aHele of this gene is 
deleted occasionally in breast carcinomas.^ 

Amplification of DNA was assessed by using the polymerase 

chain reaction (PGR) in one recent study,^ Oligoprimers for the o-erbh^2 gene 
and a control gene are added to the sample s DNA» and PGR is performed. If 
the sample contains more copies of c-erbB-Z DNA than of the control gene, the 
c-erbi-Z DNA is replicated preferentially. 

Detection of c-eri>B"2 mRNA Overproduction 

Overproduction of c-er&B-2 mRNA nsually is measured by RNA dot blot or 
Northern blot hybridization, ^th techniques require extraction of HNA but 
otherwise are analogous to DNA dot blot and Southern blot hybridization. Use 
of PGR for detection of o^r5B-2 mRNA hacbeen described in two re<:ent 
abstracts.^^^ 

Overproduction of c-cr&B-2 mRNA can be measured by in situ hybridiza- 
tion, Sections are mounted on glass slides, treated with protease, hybridized 
with a radiolabeled probe, washed, treated with nudease to remove unbound 
probe, and developed for autoradiography. Silver grains are seen only over 
tumor cells that overproduce c-er&B-2 mRNA. Negative control probes are 
used.^'W«^w Our eaLperienoe indicates that these tedmiques arc relatively insensi- 
tive for detecting oerbB-2 mRNA overproduction in routinely processed tis^ 
sue. Although the sensitivity may be increased by modifications that aliow^ 
simultaneous detection of c-0r&B-2 DNA and mRNA, in situ hybridization still 
is cumbersome and expensive (unpublished data). 

All of the above o-erfrB-2 mRNA detection techniques have several prob- 
lems that make them more difficult to perform than techniques for detecting 
DNA amplification* One n:iaJor probiem is the rapid degradation of RNA in 
tissue that is not immediately frozen or fixed. In addition, during tibe detection 
procedure, RNA can be degraded by RNase; a ubiquitous razyme, which must 
be eliminated meticulously from laboratory solutions. Third, control probes to 
genes that are uniformly expressed in the tissue of interest need to be carefully 
selected. ^ - * . 

Detection of o^rbB'2 Protein Overproduction 

The most accxirate methods for detecting c-erhB^l protein ovetproduction are 
the Western blot method and immunoprecipitation. Both techniques can docu- 
ment the binding specificity of varibiis antibodies against c-er/?B-2 protein. In 
Western blot studies, protein is extracted firom the tissue, separated by electro- 
phoresis (according to size), transferred to a membrane, and detected by using an- 
tibodies to c-6r&B<-2. In immunoprecipitation studies, antibodies against c-er£rB- 
2 are added to a tumor lysate, and the resulting protein-antibody precipitate is 
separated by gel electrophoresis and stained for protein. Both Western blot and 
immunoprecipitation are useful research tools but currently are not practical for 
diagnostic pathology. IWo recent abstracts have described an enzyme-linked 
immunosorbent assay (BUS A) for detection of c-0rtB*2 protein. 
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Overproduction of o^r&B-2 protein is most commonly assessed by various 
immunohistochemical tedmiques. These procedures often generate conflicting 
results, which are explained at least partially by three factors. First, various 
studies have used- different polyclonal and monocbnal antibodies. Because 
some polyclonal antibodies recognize weak , bands in addition to the c-erbB*2 
protein band on Western blot or immunoprecipitation, the results of these 
studies should be interpreted vrtth caution.^*^^'^ £ven some monoclonal ahti* 
bodies immunoprecipitate protein bands in addition to c-erbB-2 (plSS),™-^'* 
Second, tissue fixation contributes to variability between studies. Fbr example, 
• some antibodies detect c-erbB'2 protein only in frozen tissue and do not' react 
in iixcd Ussuc, In general, formalin fixation diminishes the sensitivity of 
immunohistochemical methods and decreases the number of reactive cells."'^ 
When Bouin*& fixative is used» there may (be a luj^c^percejitage of positive 
cases,^ Third, minimal criteria for interpreting immunohistochemical staining . 
are generally lacking. Although there is general agreement that distinct crisp 
cytoplasmic membrane staining is diagnostic for o-er&B«2 activation in breast 
. carcinoma, the number of positive cells and the staining intensity required to 
diagnose Q^rbB-2 protein overproduction varies from study to study and from 
antibody to antibody. Degradation otQ-erbh'2 protein is not a problem because 
it can be detected in intact form more than 24 hours after tumor resection . 
without fixation or freezing.^ 

p * 

m 

ACTIVATION OF c-eri>B-2 IN BREAST LESIONS 
Incidence of c-erbB-2 Activation 

Most studies of c-er&B-2 oncogene activation do not specify histological sub- 
types of infiltrating breast carcinoma. Amplification of c-er&B*2 DNA was fotmd 
in 19.1 percent (519 of 2715) of invasive carcinomas in 25 studies (Ikble 1), and 
iyerbB'2 mRNA or protein overproduction was detected in 20.9 percent (568 of 
2714) of invasive carcinomas in 20 studies. Twelve studies have documented c^ 
irbh^2 mRNA or protein overproduction iti 15 percent (88 of 604) of carcinomas 
that lacked c-erbB-2 DNA amplification. 

The incidence of c-erbEf2 activation in infiltrating breast carcinoma varies 
with the histological subtype. Approximately 22 percent (142 of 650) of infiltrat- 
ing ductal carcinomas have c*eir6B-2 activation, as expected from the above 
data. Other variants of breast carcinoma with frequent o-erbB-2 activation are 
inflammatory carcinoma (62 percent, 54 of 87), Paget s disease (82 percent, 9 of 
11), and medullary carcinoma (22 percent* 5 of 23). In contrast, c-er&B-2 activa^ 
tion is infrequent in infiltrating lobular carcinoma (7 percent, 5 of 73) and 
tubular carcinoma (7 percent, 1 of 15), 

The c-erbB*2 protein overproduction is present in 44 percent (44 of 100) of 
ductal carcinomas in situ and especially comedocardnoma in situ (68 percent, 
49 of 72). The micropapillaxy type of ductal carcinoma in situ also tends to have 
c-er66*2 activation, especially if larger cells are present. The . greater fre- 
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■ 

quency of c-erbB'2 protein overproduction in comedocorcinoma in situ, com- 
pared with infiltrating ductal carcinoma, could be explained by the fact that 
many infiltrating ductal carcinomas arise from other types of intraductal carci- 
noma, which show o-erfeB-2 activation infrequently. Others have speculated 
that carcinoma in sitii with GHer&B-2 activation tends to regress or to lose c- 
erbB'Z activation during progression to Invasion.-*^'^®* Infiltrating and in situ 
components of ductal carcinoma, however, usually are similar with respect to o- 
erbB-2 activation," althoui^ some authors have noted more heterogeneity of 
the immunohistochemical staining pattern in invasive than in in situ carci- 
noma*** Activation of.c*erfcB-2 is infrequent in lobular carcinoma in situ. If 
lesions contain more than one histological pattern of caxcjnoma in situ, the c- 
ertB-2 protein overproduction tends, to occur in the comedocarcinoma in situ 
but may include other areas of carcinoma in situ.<«^«* Overproduction of c- 
•er&B-2 protein in ductal carcinoma in situ correlates with larger cell size and a 
periductal lymphoid infiltrate.^' 

Activation of c-er&B-'2 has not been identified in benign breast lesions, 
including fibrocystic disease, fibroadenomas, and radial scars (Table 2). Strong 
membrane immunohistochemical reactivity for o-erfcB-2 has not been described 
in atypical ductal hyperplasia, although weak accentuation of membrane staining 
has been noted infrequently, m*^** In normal breast tissue, o-erfeB'2 DNA is 
diploid, and c-erfeB-2 is expressed at lower levels than in activated tumors,^35,65,BB 

These preliminary data suggest that o-erbB-2 activation may not be useful 
for resolving many of the common problems in diagnostic surgical pathology. For 
example, c-er&B*2 activation is infrequent in tubular can^lnoma and radial scars. 
In addition, because c-er&B-2 activation is unusual in atypical ductal hyperplasia, 
cribriform carcinoma in situ, and papillary carcinoma in situ, detection of c>erbB- 
2 activation in these lesions may not be helpful in then- diiferenHal diagnosis. The 
histological features of comedocarcinoma in situ, which commonly overproduces 
o-grfeB-2, are unlikely to be mistaken for those of benign lesions. Activation of 



TABLE 2. c^fbB>2 ACTIVATION IN BENtON HUMAN BREAST LESIONS 



Histological Diagnosis 


>erbB^Z DHA 


c-erb&*2 mRNA 


o-erbB^ Protein 


AmpHficBtlon* 


Overproduction 

• 


Overproduction 


Fibrocystic disease 


o/to« 






Atypical ductal hyperplaala 






2(w©ak)/21« 






> 


1(cytoplasmlo)/l3» 


Benign ductal hyperplasia 






0/1 


Sclerosing adenosis 








Fibroadenomas 


0/16,«0/6,w 


0/6,» 0^ 


0/21 ,« 0/1 0» 


* 






Q/8,w 0/3« 


Radial scars 








Blunt duct adenosis 






0/1 4» 


"breast mastoals" 




0/33* 





"Shown as number cf cases wim acUvallon/irmmbar ot cases stud1ai5; rsleronce Is given as a superacrIpL 
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c-erfcB-2, however, does fevor infiltrating ductal carcinoma over infiltrating 
lobular carcinoma, Further studies of these issues would be useful. 

■ 

Correlation of c-erlbB-2 Activation With Pattiologlc Prognostic Faetcrs 

Multiple studies have attempted to correlate c-erbB^2 activation with various 
pathologic prognostic fectors (Table 3). Activation of o-erfeBr2 was correlated 
with lymph node metastasis in 8 of 28 series, with higher histological grade in 6 
of 17 series, and with higher stage in 4 of 14 series. Large tumor size was not 
associated with c-er2»B-2 activation in most studies (11 of 14). Tetraploid DNA 
content and low pTX>liferation, measured by Ki-67, have been suggested as 
prognostic &ctors and may correlate with o^r&B-2 activation.®*^ 

Correlation of c-erbB>2 Activation With CHnlcai-PrognostIo Factors 

Various studies have attempted also to correlate C"6rfeB-2 activation with clinical 
features that may predict a poor outcome (Table 4). Activation of c«er&B-2 
correlated with absence of estrogen receptors in 10 of 28 series and with ab- 
sence of progesterone receptors in 6 of 18 series. In most studies, patient age 
did not correlate wid) c^erbB-i activation, and, in the rest of the reports, c> 
«r&B*2 activation was associated with either younger or older ages. 

Correlation of c^rbB-2 Activation With Patient Outcome 

Slamon et al^*^^ first showed that amplification of the c-erbB-2 oncogene inde* 
pendently predicts decreased survival of patients with breast carcinoma. The 
correlation of c-^rbB-2 amplification with poor outcome was nearly as strong as 
the correlation of number of involved lyihph nodes with poor outcome. Slamon 
et al also reported that e-er^B-2 amplification is an important prognostic indica* 
tor only in patients with lymph node metastasis,™*" 

A large number of subsequent studies also attempted to correlate c-er&B*2 
activation with prognosis (Table 5). In 12 series, there was a correlation be- 
tween c-erbB'Z activation and tumor recurrence or decreased survival. In five 
of these series^ the predictive value of o-er&B-2 activation was reported to be 
independent of other prognostic iactorSf-In contrast^ 18 seizes did not-confirm 
the correlation of c-er&B*2 activation with recurrence or survival, fbur possible 
explanations for this controversy are discussed below. 

One problem is that c-erbh-i amplification . correlates with prognosis 
mainly in patients with lymph node metastasis. As summarized in Table 5, most 
studies of patients with axillary lymph liode metastasis showed a correlation of 
C'erbB-^ activation with poor outcome. In contrast, most studies of patients 
without axillary metastasis have not demonstrated a correlation with patient 
outcome. Table 6 summarizes the studies in whidi all patients (with and with- 
out axillary metastasis) were considered as one group. There is a trend for 
studies with a higher percentage of metastatic cases to show an association 
between c-er&B*2 activation and poor outcome. Ibus, most of the current 
evidence suggests that e-er&B"2 activation has prognostic value only in patients 
with metastasis to lymph nodes. 
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TABLE 6, CORRELATION OF e-er bB4 ACTTVATKON WITH OUTCOMB IN PATIENTS 
WITH BREAST CARCINOMA 



Number or Patlanto 



Whh 

TVpeof • MteafBslblo 





o^B-2 
Activation^ 


rofer 


AMUIary Ho 
Lymph No{Se9 Metdsfssfs 


Statlatleal 
Analyatoo 


Reference 


<0,05 


DNA 


176 




M 


Of 


<b.05 


DNA 


61 






« 

OV 


<0.O5 


DNA 


57 




II 
w 




<ao5 


DNA 


41 

* 




11 




<ao5 


mRNA 


62 




y 


v9 


<0.05 


Protein 


■102 


* 

• *^ 




mi* 


<0,O5 


DNA 




345 


M 


01 


<0.05 


DNA 




120 


u 

w 


If 


<0.05 


DNA 




91 


u 


Of 


<0.O5 


DNA 






M 


fo. 


<0.05 


PfQtoin-WB 




350 

i 


M 


OD 


<0.05 


Protdin 




62 4d 


U 

w 




0.05-0.15 


DNA 


67 


■ 




ill 

111 


0.05-0,15 


Protein 


189 




M 


82 


0.0&-0.1S 


Protein 




120 


U 


86 


>0.15 


DNA 


130 




U 


113 


>0.15 


DNA 


122 




M 


4 


>ais 


DNA 


50 




U 


44 


>6.15 


mRNA 


57 




u 


50 


>0.15 


Protein 


290 




M 


86 


>0.15 


Protein 


195 


• 


U 


11 


>0*16 


Protein 


102 


• 


u 


39 


>0.15 


Protein 




137 


u 


17 


>0,15 


DNA 




1B1 


M 


81 


>0.15 


DNA 




- - 169 . 


U 


f/ 


>0.15 


DNA 




73 


u 




>0.15 


Proteln-WB 




378 


u 


85 


>0.16 


Proteln-WB 




192 


u 


17 


>0.15 

• 


Protein 




141 


u 


86 


>0.15 


Protein 




41 


u 


40 



The ondpotnts of Ihess studies were tumor recurronce or damased suivlvol or both. ComBlatlon between o- 
erbB-z octVaUon and a poorer pallent outcome Is elatlsficaUy slentflear)! oi <0.0S, Is o1 equivocal el^ntftcanoe 
at 0.05 to 0.16. and la not aignltteam al >o:i6. 

^Shown as vartable measured. Letiera 'V/B" (ndlceOe assay by Western blot; the other protein atudlea used 
rmmunoMstochemical methods. 

mutltvarlate statistical artaly&ls; U » untvarlate statisUoal analyala. 
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«L^fi5P^^®^ BREAST CARCINOMAS WfTH METASTASIS COMPARED 
WITH PROGNOSTIC SIQNIFICANCE OF 0^59-2 ACTIVATION 



% of tumors With 
lymph node 

metastasis In 
eech study 



70 



80- 



50- 



40 



71 (DNA)» 



61 (DNA)» 
69 (DNA)^' 

58(Proteln)"' 



64(DNA)i" 



42(P/otein)>2 



P<0.05 



4 



0.05<P<0-16 



64 (mRNA)i 
61(DNA)^ 



68(ONA)« 
57 (DNA)»^» 
55(Prololn)» 

4a(Prolefn)i» 
46(Prolefn)^ 



H 

p>o:i5 



P tor corrolallof) of o-erbS^ acllvatlan with pallent outcojne. 

Each stud/s percentage of breast carcinomas wmi irotastaHs fa compared wHh the oorrelaSon between 
ertB-2 acUvatfon and outcome. These data IncJude only dK)se studies Ihatconstdared, as onegnMjp, bB breast 
cancer patterns, whether or not they had axitlary metastasis. SupeiscrtptB are the rBlerenoa& In pvoiihoses 
are the types or c-er68-2 activation. P values are Interpreted as In Table 3. 



A second problem is that various types of breast carcinoma are grouped 
togetiier in many survival studies. Because the ciixmit litjemture suggests that 
c-er&B-2 activation is infrequent in lobular carcinoma, studies that combine 
infiltrating ductal and lobular carcinomas may dilute the prognostic eflfect of o- 
er&B-2 activation in du5*al tumors. In additian^ most studies do not analyze 
inflammatory breast carcinoma separately. This condition frequently shows c- 
ertB-2 activation and has a worse prognosis than the usual mammary carci- 
noma, but it is an uncommon lesion. 

A third potential problem is the paucity of studies that attempt to correlate 
c-^foB-2 activation with clinical outcome in subsets of breast carcinoma widiout 
metastasis. TWo recent abstracts reported that in patients without lymph node 
metastasis who bad various risk factors for recurrence (such as large tumor size 
and absence of estrogen receptors), c-er&B-2 overexpresslon predicted early 
recurrence, «»w In patients with ductal carcinoma in situ, one small study found 
no association between tumor recurrence and CHerbB-2 activation.*? 

A fourth problem is the laclc of data regarding whether the prognosis 
correlates better Mith c-erfeB-2 DNA amplification or with mRNA or protein 
overproduction. Most studies that find a correlation between c-erfcB-2 activa- 
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tion and poor patient outcome measure c^rbB'2 DNA . amplification (Table 5). 
and breast carcinoma patients with greater amplification of may have 

poorer suiYivaL'^»« Recent studies suggest diat ampllBcation has more prognos- 
tic power than ov©rproductfon,i7.M,35 but the clinical si^iflcance of e-0rfeB^2 
overproduction without DNA amplificatipn deserves fiirtlier i^eanA.^w Few 
studies have attempted to correlate patient outfcbme widi o-erfcB-2 mBNA 
overproduction, and many studies of c-erfcB-2 protein overproduction use rela- 
tively less reliable metliods such as immunohistochemical studies with poly- 
clonal antibodies. 

t 

V 

Comparison of Activation With Other Oncogenes In 

Breast Carcinoma 

Other oncogenes that may have prognostic implications iniiuman breast cancer 
aure reviewed elsewhere, This section will be restricted to a comparison 
between the clinical relevance of c-erbB-Z and diese other oncogenes. 

The c-mjfc gene is ofien activated in breast carcinomas, but c-myc activa- 
tion generally has less prognostic importance than c-irfcB-2 activation, ^•'^".w.w 
One study found a correlation between increased mRNAs of c-erbB-2 and c- 
myc, although other reports have not confirmed diis.^i<» Subsequent research, 
however, could demonstrate a subset of breast carcinomes in which c-myc has 
more prognostic importance than c-erfcB-2. 

The gene c-eriB-1 for the epidermal growth factor receptor (EGFB) is 
homologous with c*erbB-2 but is infrequently amplified in breast carcinomas. 
Overproduction of EGFR, however, occurs more fcequendy than amplification 
and may correlate with a poor prognosis. In studies that have examined both 

and EGFR in the same tumor, c-erbB-l has a stronger correlation with 
poor prognostic factors »»» Studies have tended to show no correlation between 
amplification of c^r&B-2 and c^erfcB-1 or oveiproductionof c-6r6B-2 and EQFR, 
although at the molecular level EGFR mediates phosphorylation of c^riB-2 
protein.«.W'«i.M.ioo Recent reviews describe EGFR in breast carcinoma, Aioo 

. The genes c-efbA and ear-l are homologous to the thyroid hormone recep- 
tor, and they are located adjacent to'^dterfeB* on chromosome 17. These gene^ 
are frequently coamplified with c-er&B-2 in breast carcinomas. Hie absence of 
c-erfcA expression in breast carcinomas, however, is evidence against an impor- 
tant role for this gene in breast neoplasia.^ Amplification of c-erfeB-2 can occur 
without ear-l amplification, and these tumors have a decreased survival that is 
similar to tumors with both c-0rfeB.2 and ear-l amplification." Consequently. 

c-ertB-2 amplification seems to be more important than amplification oio^erbA 
or ear-l. 

Other genes also have been compared with c-er&B-2 activation in breast 
carcinomas. One study found a significant correlation between increased o-erbB- 
2 mRNA and increased mRNAs of/os, platelet-derived growth factor chain A, 
and Ki-ro^.iw Allelic deletion of c-Ha-roy may indicate a poorer prognosis in 
breast carcinoma." but it has not been compared with c-er6B-2 activation. Some 
studies have suggested a correlation between advanced stage or recurrence of 
• breast carcinoma and activation of any one of several oncogenes.^*u3 
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ACTIVATION OP e-erbB-2 IN NON-MAMMARY TISSUES 

Ineicience of c-erbB-a Activation in Non-Mammaiy Tissues 

Table 7 summarizes the normal tissues in v^ich o-er&B-2 expression has been 
detected, usually with hnmuiiiohistocheinic&l methods using polyclonal anti- 



TABLE 7. PRESENCE OR ABSENCE OF mRNA OR e-eri^B*2 PROTEIN IN 

NORVHAL HUMAN TISSUES 



Ttfisues With 
o^f)bB-2 
mRNA 



Sklrk>4 



Stomach^* 

• 

Jejunum" 

Colon** 

Kidneys 



Lung« ^ 



Feul brain^^ 

Thyroid* 
Uterus^ 



Placenta^ 



Tissues Produelns 
c-e/t»B»2 Protein^ 



Tissues Lacking 
mRNA 



Tlssuss Lacking 
e^rbM Protein 



Epktemilsw .. 
Extemai root sheath^ 
Eoortns sweat gland^ 

Fetal oral mucosa^ 
Fetal esophagus^ 
Stomach2W8 
Fetal IntsGtlnefvi* 
Small mtestlne*^*^ 
Colonwa 

Fetal kidney«» 

FetaJ proximal tubuie"^ 
Distal tubule^ 
Fetal collecting ductus 
Fetal renal pelvis" 
Fetal ureter^ 

Hepalocytes^ 
Pancreatic acin'P 
Pancreatic ducis**'^ 
Endocrine cells of islets 
of Langertians^ 

Fetal trachea®. 
FetaJ bronchioles^ 
Bronchioles^ 



Fetal ganglion cells'^ 



Ovaiy'* 

Blood veasels^ 



Kidney&io< 



Postnatal ord mucosa<> 
Postnatal esophagus^ 



Qbrosrulusf' 

Postnatal Bowman's capsule^ 
Postnatal proximal tubule^ 

Postnatal collecting duct^ 
Postnatal rensl pelvis^ 
Postnatal fetal uretei^ 

Uvei«» 



Pancreatic Islets^ 

Postnatal trachea^ 
Postnatal brcmchtoles^ 

Postnatal alveoli^ 

Postnatal braln« 
Postnatal ganglion ceils^ 



Endothelium"' 

Adrenooortioal cells^ 
Postnatal thymus^ 
Rbroblasts" 
Smooth muscle catls^ 
Cardiac muscle celts^ 



This protein study used Weslem bloto; tha rest used {mmunohlatochomical methods. 
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bodies. Only a few studies have been perfonned, and some of these do not 
demonstrate convinctng ceU membrane reactivity in the published photo* 
graphs. The interpretatioas in these studies, however, are listed^ with the 
caveat that tiiese findings should be confirmed by imlnnnopfecipitation or 
Western or RN A blots. Production of oerbB-^Z has been identified in normal 
epithehum of the gastrointestinal tract and skin. Discrepancies regarding c- 
,erbB-2 protein in other tissues could be due, at least in part, to difierences in 
techniques. 

The data on c-er&B-2 activation in various non-mauun.ary neoplasms 
should be interpreted with caution, because only small numbers of tumors have 
been studied, usually by immunohistochemical methods using polyclonal anti- 
bodies. Studies vsing cell lines have been excluded, because ceQ culture can 
^ induce ampUficatioh and overexpression of other genes» although this has not 
' been documented for c-er6B"2. ' " 

AcUvation of c-er&B-2 has been identified in 32 percent (64 of 203) of 
ovarian carcinomas in eight studies (Table 8). One abstract*^ stated that ovarian 
carcinomas contained significantly more o-er2iB-£ protein than ovarian non- 
epithelial malignancies. Another report^i showed that 12 percent of ovarian 
carcinomas had c-er6B-2 overproduction without amplification. 

Activation of o-erfeB-2 has been identified in 20 percent (40 of 198) of 
gastric adenocarcinomas in seven studies^ including 33 percent (21 of 64) of 



TABLE 8. e<eri»B-2 ACTIVATION IN HUMAN GYNECOLOGIC TUMORS' 















mRNA 


Protein 




c^rbB-l DNA 


Ovw 


Ovcr- 


Tumor Type 


Amplification 


produ«:ti6n 


produelfon 


Ovary — carcinoma, not oUienvlse 


91/1 20 « 1/11,5^ 


2a/6r« 


23/73,« 


specifiad 


O/S, w 0/6,«* 0/3»^« 


38/7291 




0«,»Q/1"« 




■ 


Ovary— serous (papUlaiy) carcinoma 


2/7,i«1/7,>«0/6« 






iOvary-— endomatrtoidcardnonna- * 






f 


Ovary— mudnous carcinoma 








Ovary-^^lear cell carcinoma 


0/2,^«0^^ 






Ovary— mixed epiihelial carcinoma 








Ovary— endomeWoW borderClne tumor 


0/172 






Ovary— mudnouB borderline tumor 


oy3« 






Ovary — serous oyatadenoma 

■ 








Ovary— mucinous cyatadenoma 


0«7» 






Ovajy-*^sciaroslng atromal tumor 


on« 






Ovary— fibrothecoma 


on" 






Uterus — endometrial adenocarcinoma 


0/4 M 0/1 ♦•o 







•Shown as number o! cases with amplincatlon (or overproducllon)/tolal number of cased studied; reference te 
given as supersoript All protein studies used Immunohistochemtcal methods. 
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intestinal or tubular subtypes and 9 perpent (4 of 47) of difiiise or signet ring cell 
subtypes (Table 9). ActivaHon of o-erfcB-2 has been detected in 2 percent {6 of 
281) of colorectal carcinomas, although an additional immunohistochemical 
study detected c-er&fr-2 protein in seven of eight tissues fixed in Bouin s solu- 
tion. One study found greater immunohistochemical reactivity for o-ariB-2 
protein in colonic adenomatous polyps than in the adjacent normal epiflielium^ 
using Bouin s fixative. Lesions with anaplastic features and progression to inva- 
carcinoma tended to show decreased immunohistnchemical reactivity for c- 
ct*B^2 protein « Hepatocellular carcinomas (12 of 14 cases) and cholauglocaroi- 
nomas (46 of 63 cases) reacted with antibodies against c^bB-2 in one study, but 
some of these "positive" cases showed only difiiis^ cytoplasmic staining, which 



TABLE 9. f>erbM ACTIVATION IN HUMAN QASTHOINTESTINAL TUMORS" 



Tumor type 

Esophagus— squamous cell carcinoma 
Stomach— carcinoma, poorly differentiated 
Stomach — aderwoardnoma 

Stomach— cardnoma, tntestina) or tubular type 
Stomaoh-Hmrclnoma, diffuse or signet ring cell type 
Colorectum— carcinoma 



Colon— villous adenoma 
Colon— tubulovillous adenoma 
Colon — tubular Sonoma 
Colon— ^hyperplastic polyp 
Intestine— letomyosarcjpma 
Hepatocellular carcbtoma — 
Hepatoblastoma 
Ctiolanglocardnoma 
Pancreas— adenocarcinoma 
Pancreas— acinar carcinoma 

■ 

Pancreas— dear cell cardnoma 
Pancreas— large cell carcinoma 
Pancreas— signet ring cardnoma 
Pancreas— chronic Inflammation 



e^B-2DNA 

Amplification 

0/1 w 

2/24 « 2/9 w 2/8,^i< 
2/8.W O/I^M 

5/10i« 

0/21W 

1/45,w.l/45,» 
0/40," 0/32,"' 0/3W 

0/5W 
0/7" 
0/1« 



Q/12"< 

0/15^. 



Protein 
Over- 
production 



0/1 



SI 



4/27 » 3/1 0«t 

16/549 
4/4S® 

1/22,"7/Q=» 



t9/10aM> 

12/14?sQ/j26i 

46/63» 

2/B0»<^«» 0/2«< 
0/1 « 
0/2^' 
0/3^1 
0/1« 

0/14*^0 



•Shown as number of cases with amplification (oi overproducUon)/lotal wmtwr ol caseo slwftwl; roleronce la 
given as supewcripL AU protein studies used ImmunoWstochemlcal methods. No stucflee analyzed foi e^/feB- 
2 mRNA. — 

*TlssuBs fixed In Bouln'e Gotutlon. 

cQnly cases with distinct membmne staining are Interpreted as showing c-ert^B-a overproducUon. 
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TABLE 10, e*crbB-2 ACTIVATION IN HUiiAAN PULMONARY TUMORS* 



• 




c-erOB«2 


Tumor Typo 


0^B-2 0NA 


Piotein 


AmplHlcatlon 


Overproduction 


Non-8mal) oafl carcinoma 


2/60/6 0^60^1 


i/e4» 


Epidermoid cardnoma 


0/13 « 0/10,"' 0/6» 


3/S" 


Adenocarcinoma 


0/2t,« 1/13," 0/7,?« 0^,67 0/3W 


4/12» 


Large ceil caroinoma 


0/9 « 0/^ 


« 


SmaD cell carcinoma 


■ 


0/26,na/3^ 


Carcinoid tumor 


0/1« 


.0/3» 



. "Shown as nuinber of oases with ampOBoaHon (or overprodiidlooVlofcErf numbef of cases stucRedj referonce Is 
given as eupeiBorfpl Mi protein studies used Imniunohlstochemlcal msthods. No studies anslyzsd for o«/AB* 
2 mRNA. 



does not indicate c-erbh-Z activation in breast neoplasms," Also, some pancre- 
atic carcinomas and chronic pancreatitis tissue had cytoplasmic immuiiohisto- 
chemical reactivity for Q-er&B*2 protein, in addition to the rare case of pancre< 
atic adenocarcinoma with distinct cell membrane staining.^ 

Tables 10 through 14 summarize the studies of c-srbB-2 activation in other 
neoplasms. The oerbB'2 oncogene is not activated in most of these tumors. 
Activation of oerbB-Z has been detected in 1 percent (4 of 299) of pulmonary 
non-small cell carcinomas in nine studies, although one additional report 
found c*erliB-2 protein overproduction in 41 percent (7 of 17), Renal cell carci- 
noma had c-eriiB-2 activation in 7 percent (2 of 30) in four studies. Overproduce 
tion of c-er&B*2 protein was described in one transitional cell carcinoma of the 
urinary bladder, a grade 2 papillary lesion." Squamous cell carcinoma and basal 
cell caroinoma of the skin may contain c-0r6B-2 protein, but H is not clear 

TABLE 11, frerl>B>2 ACTIVATION IN HUMAN HEMATOLOGIC PROLIFERATIONS^ 



• 

• 

> 

* * 

Tumor Type 


* 

- c.eriiB-2 DNA 
Amplification 


c-ertiB-2 

ipRNA. 
Over- 

produclton 


c-en)B-2 
Prorlein 
dver« 
production 


Hematologic malignancies 


0/23<" 






Malignant lymphoma 






0/1 5« 


Acute leukemia 

• 


0/1 




* 


Acute lymphoblastic leukemia 


0/1 w 






Acute myeloblastio leukemia 


0/3W 






Chronic leukemia 


0/185^ 




• 


Chronic lymphocytic leukemia 


0/6tw 






Chronto myelogenoue leukemia 






• 


Myeloproliferative disorder 


0/1 ST 







•Shown as numbar of oa^es wHh ampHIIcatlon (or overproductlonVtoiaJ ngmbor of oases studied; referenoe Is 
given as siiperscrtpf . All protein studies used Immunohlstochemica} methods. 
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TABLE 1Z o€rbM ACTIVATtOW IN HUMAM TUMORS OF SOFT TIS SUE AND BONE- 

DNA 



. TUmorType Amplfflcation 

Sarcoma o/io»i^i o/a^' 
Malignant fibrous hlstiooytoma 0/1 <^ 

Uposarooma a^i<nr 

Pleomorphic fiorconrta Q/V^ 

Hhabdomyosarooma Q/V^ 

Odteogenlo sarooma 0/2,>i>'0/2^ 

Chor\drosarcoma q/iw 

Ewing*s sarcoma Q/1 67 

Sohwannoma 0/1^ 



"Shown as number of cases with amplification (or overpradueikm)^! numtor of cases studied; reference Is 
afven as supefsciIpL No studies analyzed for 0^6*2 mRNAor ^mbB-Z pioteln. 

whether the protein level is increased over that of normal sldii." Thyroid 
carcinomas and adenomas can have low levels of increased c-erfeB-2 mRNA. 
One abstract, described low-level c-er6B-2 DNA amplification in one of ten 
salivary ^and pleomorphic adenomas.-^ 

Correlation of c-erbB>-2 Activation With Patient Outcome 

Very few studies have attempted to correlate c-erfcl^2 activation in non- 
raammaiy tumors with outcome.* Slamon et al« showed that c-erbB*2 ampliflca* 
tion or overexpression in ovarian carcinomas correlates with decreased survival, 
especially when marked activation is present. However, they did not report the 
stage, histological grade, or histological subtype of diese neoplasms. Another 
study of stages III and ly ovarian carcinomas found a conelation between 
decreased survival and c-er2'B-2 protein overproduction, but not between sur- 
vival and histological grade. i« One abstract stated that G-er27B-2 protein oveipro- 
duction in 10 of 16 pulmonary adenocarcinomas correlated with decreased 
disease-free interval.'"* Another ab.sttap.Ldescribed atendenc^ for inununohisto- 



« 

TABLE 13. 0-erlyB*2 ACTIVATION IN HUMAN TUMOBS OF THE URINARY TRACT* 



• 

Tumor Type 


• 

o«rbB-2 DNA 

Amplification 


mRNA 
Over* 
production 


Protein 
Over- 

produetlon 


Kidney— renal ceil carcinoma 


VS,^ 1/4,iw 0/5M 


0/16'M 


• 


Wilms' tumor 






g _ _ 


Prostate— adenocarcinoma 






0/23» 


Urinary bladder^^wrdnoma 






1/48» 



•Shown as number of cqbob with empllf Icallon (or overproduci1on)/lota) number of cases studied; referenoe Is 
given aa superscript. All protein eiudtes used Immunohlstochefiiical methods. 
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TABLE 14. <wM>2 ACTIVATION IN MISCELUNEOUS HUR/IAN TUMORS* 



Tumor Type 

SKlrwrnaSgnent melanoma 
Skin, head and nack-^squamous 
cell carcinoma 

Site not stated— squamous cell 
carcinoma 

Salivary gland— adenocarcinoma 
Parotid gland— adenoid cyalic 
carcinoma 

- ThynM^naplastk} carcinoma 
Thyroid— papillary carcinoma 
Thyroid— eidenooarclnoma 
Thyroid — adenoma 
Neuroblaetoma 
Monlngk)ma 



DNA 

Amplification 

0/8," 
1/1 » 



e-erM-2mRNA 
Ovarproduotfon 



Protein 
Over- 
production 

0/10» 



0/1« 



0/1 « 
0/5» 
0/1« 



30OW levets)/5f 
1(bw level&)/2i 



•Shown 85 number of cases with amplincatlon (or overproducUon)/UrtaJ number cl cases sludM refereitoe to 
given as eupersoitpt All protein studies used Immunoh&tocheirioal melhods. 

« 

chemical reactivity for ceriB^g protein to correlate with higher grades of pros- 
tatic adenocarcinoma.^' Additiorial prognostic studies of ovarian parcinomas axid 
other neoplasms are needed. 



■ 



SUMMARY 

Activation of the oncogene can occur by ampUBcation of o-effcB-2 

DNA and by overproduction of mRNA and WfcB-.2 protein. Approxi- 

mately 20. percent of breast carcinomas shovr, ??yidence of «o-er&BC2 activation, 
which correlates with a poor prognosis primarily in patients with metastasis to 
axillary lymph nodes. Studies that have attempted to correlate c.cr6B-2 activa- 
tion with other prognostic factors in breast carcinoma have reported conflicting 
conclusions. The pathologic and clinical significance of cerfeB-2 activation in 
other neoplasms is unclear and should be assessed by additional studies. 
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P.O. Box 1450 
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I, J. Christopher Grimaldi, declare and say as follows: 

1 . I am a Senior Research Associate in the Molecular Biology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

2. I joined Genentech in January of 1999, From 1999 to 2003, 1 directed the Cloning 
Laboratory in the Molecular Biology Department. During this time I directed or performed 
numerous molecular biology techniques including flualitative Polymerase Chain Reaction (PCR) 
analyses. I am currently involved in, among other projects, the isolation of genes coding for 
membrane associated proteins which can be used as targets for antibody therapeutics against 
cancer. In connection with the abbve-identified patent application, I personally performed or 
directed the semi-quantitative PCR analyses in the assay entitled "Tumor Versus Normal 
Differential Tissue Expression Distribution" which is described in EXAMPLE 18 in the 
specification that were used to identify differences in gene expression between tumor tissue and 
their normal counterparts. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

4. In differential gene expression studies, one looks for genes whose expression levels 
differ significantly under different conditions, for example, in normal versus diseased tissue. 
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Dear Sir: 
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Chromosomal aberrations, such as gene amplification, and chromosomal translocations are 
important markers of specific types of cancer and lead to the aberrant expression of specific 
genes and their encoded polypeptides, including over-expression and under-expression. For 
example, gene amplification is a process in which specific regions of a chromosome are 
duplicated, thus creating multiple copies of certain genes that normally exist as a single copy. 
Gene under-expression can occur when a gene is not transcribed into mRNA. In addition, 
chromosomal translocations occur when two different chromosomes break and are rejoined to 
each other chromosome resulting in a chimeric chromosome which displays a different expression 
pattem relative to the parent chromosomes. Amplification of certain genes such as Her2/Neu 
[Singleton e^ a/., Pathol, Annu. , 27Ptl: 165-190], or chromosomal translocations such as t(5;14), 
[Grimaldi e/ a/.. Blood, 73(8):208 1-2085(1 989); Meeker et al. Blood , 76(2):285-289(1990)] give 
cancer cells a growth or survival advantage relative to normal cells, and might also provide a 
mechanism of tumor cell resistance to chemotherapy or radiotherapy. When the chromosomal 
aberration results in the aberrant expression of a mRNA and the corresponding gene product (the 
polypeptide), as it does in the aforementioned cases, the gene product is a promising target for 
cancer therapy, for example, by the therapeutic antibody approach. 

5 . Comparison of gene expression levels in normal versus diseased tissue has 
important implications both diagnostically and therapeutically. For example, those who work in 
this field are well aware that in the vast majority of cases, when a gene is over-expressed, as 
evidenced by an increased production of mRNA, the gene product or polypeptide will also be , 
over-expressed. It is imlikely that one identifies increased mRNA expression without associated 
increased protein expression. This saine principle applies to gene under-expression. When a 
gene is under-expressed, the gene product is also likely to be xmder- expressed. Stated in another 
way, two cell samples which have differing mRNA concentrations for a specific gene are 
expected to have correspondingly different concentration of protein for that gene. Techniques 
used to detect mRNA, such as Northern Blotting, Differential Display, in situ hybridization, 
quantitative PGR, Taqman, and more recently Microarray technology all rely on the dogma that a 
change in mRNA will represent a similar change in protein. If this dogma did not hold tme then 
these techniques would have little value and not be so widely used. The use of mRNA 
quantitation techniques have identified a seemingly endless number of genes which are 
differentially expressed in various tissues and these genes have subsequently been shown to have 
correspondingly similar changes in their protein levels. Thus, the detection of increased mRNA 
expression is expected to result in increased polypeptide expression, and the detection of 
decreased mRNA expression is expected to result in decreased polypeptide expression. The 
detection of increased or decreased polypeptide expression can be used for cancer diagnosis and 
treatment. 

6. However, even in the rare case where the protein expression does not correlate 
with the mRNA expression, this still provides significant information useful for cancer diagnosis 
and treatment. For example, if over- or under-expression of a gene product does not correlate 
with over- or under-expression of mRNA in certain tumor types but does so in others, then 
identification of both gene expression and protein expression enables more accurate tumor 
classification and hence better determination of suitable therapy. In addition, absence of over- or 
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under-expression of the gene product in the presence of a particular over- or under-expression of 
mRNA is crucial information for the practicing clinician. For example, if a gene is over-expressed 
but the corresponding gene product is npt significantly over-expressed, the clinician accordingly 
will decide not to treat a patient with agents that target that gene product. 

7. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information or belief are believed to be true, and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of the United States 
Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 

By: f I ._ Date: ^ 

i. Chnstopher Grimaldi 

S:\DOCS\AOK\ApiC-5479.DOC 
071904 ' 



EXHIBIT 3 



DECIJ\RATI0N OF PAUL PpLAKIS. Ph.D. 

I, Paul Polalds,Ph,D., declare and say as follows: , 

1.1 was awarded a Ph.p. by the Department of Biochemfetry of the Michigan 
St^e Umvemly in 1984. My scientific Curriculum Vitae is attached to and fonns 
part.of this Declaration (Exhibit A). 

■ 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist^ S mce joining Genentech in 1999, one of my primary responsibilities has 
been leadmg Genentech's Tumor Antigen Project, which is a large research project 
witii a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. _ As part of the Tumor Antigen iProject, my laboratory has been analyzing 
dmerential expression of various genes in tumor cells relative to normal cells 
The purpose of this research is to identify proteins, that are abundantly expressed 
on certam tumor cells arid that are either (i) not exl)ressed. or (ii) expressed at 
lower levels, on corresponding normal cells. We caU such differentially expressed- 
proteins tumor antigen proteins", When such a tumor antigen protein is 
ideiitified, one can produce an antibody that recognizies and binds to that protein 
Such an antibody finds use in the diagnosis of human cancer and. rhay ultimatelv 
serve as an effective therapeutic in the treatment of human cancer. 

4. In .the course of the research conducted by Genentech's Tumor Antigen 
Project, we haive, employed a variety of scientific techniques for detecting and 
studymg differential gene expression in human tumor cells relative to normal cells 
at genomic DNA, mRNA and protein levels. An important example of one such ' 
techruque is the well known and widely used technique of microarray analysis 
whjch has^roven to be extremely usefiil for the identification of mRNA moleevSles 
that are differentially expressed in one tissue or cell type relative to another In the 
course of our research using microarray analysis. We have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corriesponding normal human cells. To date we 
have generated antibodies that bind to about 30 of the tumor antigen proteins ' ' 
expressed from these differerltially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 

^ - * ^^^^^^ 

coitesponding normal cells. We 
have then compared the levels pf mRNA and protein in both the tumor and normal 
cells analyzed. 

5 From the mRNA and protpin. expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



e?qjressed from that mRNA in that cell type. In appix)xiinately 80% of our 
observ^ons we have found that increases in thei level of a particular iniElNA 
correlates with changes in the level of protein expressed from tiiat mKNA v;4ien 
human tumor cells are compared with their correspoiiding noiinal cells. 



knowledge of the relevant scientific literature, it is my considered scientific 
. opinion that for human genes, an increased level of roRNA in a fumor cell relative 
. to a normal ceil typically. C(»Tdates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 

con^esponding increased levels of the. encoded protein. . While there have been 
published reports of genes for which sudi a correlation dioes not exist, it is. my 
ojpinidn that such reports are exceptions to the commonly understood general rule 

that increased mRNA levels are jpi-edictive. of corresponding increased levels of the 
encoded protein. 

* 

• * ■ 
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7. I hereby declare that all stateinents made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and fiuther that these statements were made with the knowledge that.willfiil felse 
statements and the like so made are puipshable by fine or impris^^oment, or both, 
under Section lOOl of Title 18 of the United States Code and that such willAU 
statements may jeopardize the validity of the application or any patent issued 
fliereon. 
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High-throughput technologies, such as proteomic screening and DNA micro-arrays* produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologkatly relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated llterature-mfnlrig tool, termed MedGene* which 
comprehensively summarizes and estimates the relative strengths of all human gene--disease 
relationships in- Medline. Using MedGene. vy^ analyzed a novel mlcro-array expression dataset 
comparing breast carKer and normal breast tissue in the context of existing knowledge. We found no 
correlation t>etween the strength of the literature association and the magnitude of the differerwe in 
expression level when considering changes as hi^ as 5-fotd: however, a significant correlation was 
observed (r.= 0.41; p = 0.05) among genes showing an expression difference of 10>fold or more. 
. Imerestlngly, this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
Identified a set of relatively understudied* yet highly expressed genes in ER negath/e tumors worthy of 
further examination. 

Keywords; bk>inroTm3tics • micro-array » text mining • gene-diisease association • t^reasl cancer 



Introduction 

Al its current pace. Uic accumulalion of biomedical Literature 
outpaces the ability of most researchers and cllnlclaiis lostay 
abreast of Uiclr own iniincdiatc Odds, lot alone cover a broader 
range of topics. For example, to follow a single disease. e.g.. 
breast cancer, a rcseardier would have had to scan 130 dilTcrcnt 
journals and read 27 papei3 per day in 1999.» This problem Is 
accentuated wltJi hlgh-Uirougliput teclinologjcs such as DNA 
micro-arrays and protconiJcs. which require Ihe analysis of 
large datascls involving tlwusands of genes, many of which arc 
unfamiUar (o a particular researcher. In any microarray experi- 
ment, thousands of genes may dcmonsirale statisUcally sig- 
nificant expression changes, but only a fracUon of these may 
be relevant to the study. The ability to interpret these dalasets 
would be enhanced if they could be compared to a compre- 
hensive summary of wliai Is known about all genes. Thus, there 
is a need to summarize existing knowledge In a format that 
allows for flic rapid analysis of associations between genes and 
disea.sc.'s or otlwr specific biological cojKCpts. 

One solution to this problem is to compac structured distal 
resources, suclt as the Breast Cancer Ccne Datat>ase* and the 
Tumor Ceiic Database.' Mowcvcr. as these resources arc liaiul- 
curntcd. tlic labor-intensive review process becomes a rate^ 
liiniling step in tl>e t>rowlIi of tJ>e database. As a result, these 
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databases have a limited scale and tlic genes arc not selected 
In a systematic fashion. 

An allemallvc approach Is automated text mlr\ing; a method 
which involves atilomated infonnatlon extraction by searching 
documents for text strings and analyzing their frequency and 
context This approach has been used successfully In several 
instances for biological applications. In most cases. It lias been 
applied to extract information about the relationships or 
interactions that proteins or genes liave witl) one another, in 
Uie literature or by functiona] annotation.'*^ Thus far. few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez-Iratxeta etal. 
automatically examined ilie CO (Gene Ontology) annotation 
of genes and their predicted chromosomal locations in order 
to identify genes linked to Inherited disorders.' 

To obtain a more global understanding of disease develop- 
ment, it would be valuable to Incorporate infonmation rc^uding 
all possible gcncdlsease relationships. Including biochemical, 
physiological, pharmacological, epidemiological, as well as 
genetic. This Infomiation would enable compreliensivc com- 
parisons tietwecn large experimental datasots and existing 
iuiowledge fa) the literature. Tills would accomplish two things. 
First, it would serve to validate experiments by demonstrating 
tiiat known responses occur as predicted. Second, it would 
rapidly highlight wlilch genes are corroborated by the llleraturc 
and which genes are novel in o given contcxL Wc liave utilized 
a computational approach to literature mining to produce a 
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comprehensive set of gene-diseasc relationships. In addition, 
we have developed a novd approach to assess the strength of 
each association based on the frequency of dlatkm and co- 
dtaUon. We applied tliis tool to help Interpret the data (ram a 
large micro-array gene expression experiment comparing 
nonnal and cancerous breast tissue. 

Methods 

MedCcne Database. McdCene Is a relational database, stor- 
ing disease and gene information from NCBI, text mfaiing re- 
sults, statistical scores, and hyperiinks to the primaiy Ut* 
crature. McdCcne has a web-based user interface for users lo 
queiy itie database (http://hipseqjnedi)arvatd.edu/MedGenc/). 

Text Mining Algorithms. MeSH files were ii|pwnloaded from 
theMeSHwebslleatNUI (Nation Ubnuy of Medldne) (http:// 
www.nbnjnih.gov/inesh/fneshhomeJilinI) and human disease 
categories were selected. Lcx:u5Unk iUes were downloaded from 
the LocusLink web site at NCBI (hUpy/www.ncbl. nlh.gov/ 
LocusUnk/). OfHcial/preferFed gene symbol ofljclal/prefefied 
gene name, and gene alternative symbols and names, all 
relevant annotations and XSBLs for each LocusUnk record, were 
collected. Gene search terms were used for literature searching 
and included alt qualified gene names, gene symlmls. and gene 
family terms. Primary gene keys, predombiantly qualiRed gene 
family tenns and gene ofGcial/p referred symbols, were used 
to Index Medline records. If the omdal/prcferred gene symbols 
did not meet (he standards to be an Index, then qualiOed gene 
ofiicial/prcfcned names were used. A local copy of MedUne 
records (up to July, 2002) was pre-selectcd. 

A JAVA module examined the McSH terms and then Indexed 
each Medline record wiUi the appropriate disease tcnns. A 
separate JAVA module was u$«l to cxaiitltic Um; titles and 
absu^cis for gene search tcnns and Uien to index the gene- 
relaicd Medline records wilh the relevant primary gene kcy(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were Indexed for both gene and disciase 
(double positive hits), for disease only (disease single Wis), for . 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency tablc-framcwork. wc applied 
difTerenl statistical moUiods to estimate the strcngU> of gene- 
disease relationships and evaluated the results. These methods, 
included chi-square analysis. Flshcr^s exact probat»illties, rela- 
tive rtsk of gene, and relative risk of disease*' (http:// 
})ipscq.mGd.harvardjedu/McdCcne/). In addition, we computed 
the 'product of frequency*, which Is the product of the 
proportion of disease/gene double hits to disease single Wis 
and the proportion of diseasc/gcnc double Wts to gene single 
Wts. To obtain a nonnal distribution, we traruformed all the 
statistical scores using die natural logarillun. Wc selected Ihc 
log of the produa of frequency (LPF) to validate MedCene and 
to use for the analysis with the micro-array data. Spearman 
rank-correlation cocHicients were used to assess the linear 
relationship between I.PF and micro-array fold change in 
expression level. 

Global Analysis. Diseases wltJi at least 50 related genes were 
sclcclcd for clusteriiig analysis, and tJie LPF scores were 
noniiali7.ed wIOi toial score for each disease. Illcmrchical 
clustering was done wilh (he "Cluster" software aiKl the 
clustering result was vlsunllzed using "TrceVicvvcr" (http:// 
rana.ibl.gov/niscnSoftwarc.htm). 
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- Breast Tissue Micro-Arrays. Eighty-nine breast cancer 
saffl|des (79% ER-posltlve) and 7 noraial breast tissue samples 
were selected from tlie Harvard Breast SPORE finozen tissue 
reposlloiy and were representative of the spectrum of lilslo- 
toglcal types, grades, and hormone receptor Immuno-pheno- 
types of breast cancer. Biotlnylaled cRNA. generated fiom the 
total WA extracted from the bulk.ttmior, was hybridized to 
Aff)rmetrix U95A oUgo-nudeotide micro-arrays. Ihese mkro- 
arrays consist of 12 400 piobes» which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Afiymctilx. and then lurtlier analyzed using 
the DNA*C3iip Analyzer (dCWp) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations in the literature, 
we flrst compiled complete Usts for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases^ the Medical Subject Heatfing (MeSH) index 
of Medline records was utilized. MeSH is a controlled medical 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged in both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to eadi with 
software assistance.*-*" Twenty-three human disease category 
headings along with all of tiielr child terms (see the Supporting 
Information. Supplemental Table 1, or visit http://Npseq. 
medJiarvard^u/MedCoie/publicaUon/sjrable Lhtml) were 
selected from the 2002 MeSH Index creating a list of 4033 
human diseases. 

No Index comparable to the McSH Index exists for genes. 
arwJ thus, it was necessary to apply a string search algorltlun 
for gene names or symbols found In Medline icxL A complete 
list of genes, gene names, gene symbols, and frequently used 
synonyms were coUecLed from the LocusLink database at 
NCBl,"-*^ which contains 53 259 Independent records keyed 
by an oflklal gene symbol or name (June 18*^. 2002). For the 
purposes of this study, no distinction was made t>etwcen genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of italics, if at all. 
For the intended use of tills study, this lack of distinction Is 
unlikely to have a large effect and may In fact be t>eneGciai. 

Initial attempts lo "search the Ulerature using theso lists 
revealed several sotuxes of false positives and false negatives 
(Table 1). False positives primarily arose when the searched 
term had other meanings, whereas Calse negatives arose from 
syntax discrepancies necessitating the development of filters 
to reduce these errors. The syntax Issues were readily handled 
by Including alternate syntax forms in the search terms. The 
false positive cases, caused by duplicative and urvelatcd 
meanings for the terms, were more diflkull to manage. Where 
possible, case sensitive stnr^ mapping reduced inappropriate 
citations. In many cases, however, this was not suflident and 
the terms had to be eliminated entirely, thereby reducing the 
false positive rale but unavoidably undcr-reprcsenUng some 
genes. 

For the purposes of data tracking, a primary gene key was 
selected to represent all s3rnonyms that corrcsporul to each 
gene. Medline rcconis tverc indexed with a primary gene key 
when any synonym for that key was found in the lltic or 
abstract Case-insensitive string mapping was used for all 
searches except as noted above.- No additional weight was 
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Table 1. Systematic Sources of Falsa Positives and False Negalh/es in Unfiltered Data* 



source of error 



type 



Qtercokitlen 



gene symbot/name 
Is not unique 



gene symbol is 

unrelated abbrevlaifon 
gene STmboI/name 

has language meaning 
nonstandard synuuc 
unofHdal gene name/symbol 
zionspedflcd gene name 



false positive 



false posillve 
false posliive 

false negative 
false negative 
false ncgalive 



A£4C-myelin 

associated glycoprotein 
AM(7-inalignancy-assodated 

pioiein 

PA—paiUd bomologue (mouse), 
paDldin (abo abbrev. for Pennsylvania) 

W^Wiskott-Aldrlch Syndrome 
(abo the \vDFd "was") 

£4(7./ Instead offil^CJ 

FiJ instead of 775 J 

estrogen receptor instead of 
Estrogen receptor I 



eliminate this term 



eliminate this term 
case-sensitive stiiog search 

add dash term 

add all gene nicknames 

add iiainUy stem tcnn 



* In pfeBmlfMuy s(tidles» Medline was searched for co-occuncnce of genet and dbeaaes and (he resulting ou^mt was evaluated Co Identify ctror soiicccs thai 
were amenable Co slobal fillen. Each error source Is categorized by the type of emr IC causes: (Use positives are su^giested itlatlenshtps tfaat are not real and 
fabe negBllves ere real rdatlonships thai are undcmpicsenud. Tbe filler solutions used are Indicated. Note Chat In some caso; IhefBlersoliiUon Usdf Inlioduces 
error. In B^neral. error rates maxImUcd sensilMty. even at the expense of spccUtdty If needed. 



added for multiple occurrences of a term or the co-occurrence 
of multiple synonyms for the same gene key. 

Medline records were searched with all qualified g^ne 
identifien. such as the oOlclal/prefcrred gene symbol, the 
oRidal/prefened gene name, all gene nicknames artd all syntax 
varlanu. In situations where there arc several members of a 
gene fairdly or spUce variants, some authors prefer to use a 
shortened gene family name, e.g., estrogen receptor instead of 
ctstrogen receptor 1 (E5RJ), creating a source of false negatives. 
For this reason, gene family stem terms were created for all 
genes Uiat have an alpha or numerical sufDx (e.g.. JL2RA, TCFfi, 
ESRl, etc.) and then used to search the literatuFe. The family 
stem terms were handled separately from the specific gene 
names so tiiat It would be dear when linkages were made lo 
the gene family versus a specific member In that family. 

To improve perfonnance and accuracy, some pre-sclecUon 
%vas applied to the records tiiat were scarmed. Rrel, review 
articles were eliminated to avoid redundant treatment of 
citations. Second* nor>>English journals were removed because 
the natural language itlters were only relevant to English 
publications. Finally. Journals unlikely to contain primary data 
about gcne-dlseasc relationships were also removed (e.g.. Int. 
/ Health Educ., Bedside Nurse, and / HealUi Econ), Togcliicr, 
these fillers reduced the 12 198 221 MedUne publications (]uly 
2002) by 37%. 

Ranldng the Relative Strengllis of Ccnc-Discasc Associa- 
tions. In total, there were 618 708 gene-disease co-cilations. 
in which 16% (8297) of all studied genes had been associated 
to a disease and 96% (3875) of all diseases liad been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
methods and examined the results. With tlie exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order of the gene- disease association 
stiengtlis. However, ailer comparing the results to otiicr 
databases and after consulting disease experts, the tog of tiie 
product of frequency (LPF) was selected for further analysis 
because it gave the best results overall. 

Validation of MedCcnc. In developing this tool, it was 
important to minimize the number of missed genes (false 
ncgathres) and miscalled genes (fake positives). However, In 
situations when these goals were In confiicl. indusivencss was 
prioritized. To determine tiie false negative rale in MedCcnc, 
breast cancer was used a Ibst case because It was associated 
with more genes tiian any oilier human disease ajid because 




Figure 1. Estimation of the false negative rate by ccMnparison 
with haiuKurated databases. The breast cancer-related genes 
Idemifled by MedGene were compared with those llsied In 
several other databases including the Tumor Gene Datat>ase 
(TGD).> the Breast Cancer Gene Daiaba5e(BCG).* GeneCards 
(GC)^^ and Swissprol.^* Genes were considered false negatives 
if Ihey were represented in at least one of these other databases 
and not in MedGene and tfieir (ink to breast cancer was sup- 
ported by at least one literature reference. All literature references 
were verified by manual review to confirm their validity. The 
number of genes in each database or shared by more than one 
datatMse is indicated. The false negative rale was calculated by 
genes missed at MedGene (26)/loial number of rK>noverlapping 
genes in other databases (285). 

tiierc were several publk: databases that link genes to breast 
cancer. We compared the list of breast cancer-related genes 
from McdCene to liicse dat;ibases. Ulusirated In Figure 1. 
Among tlie 285 distinct breast conccr-rclalcd genes that were 
supported by at least one literature citation in these hand- 
curatcd databases, 26 were absent from McdCene. suggesting 
a false negative rate of approximately 9%. To dcicmiiiie why 
these were missed, all literature references for ttiesc genes (80 
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papers) were reviewed manually (see the Supporting Informa- 
tSon, Supplemental Table 2, or visit htip;//htpseq.med. 
harvardedu/MedCene/pubUcation/sjrable 2.htin0. Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by our spedfidfcy filters. 
Few genes were missed because they were only mentioned in 
review papers (0.4%) or they appeared only In the body of (he 
manuscript but not the abstiacL or tide (1.1%). Of note. 
MedCene Identified approximately 2000 additional breast 
cancer-related genes not Usted In any other database. 

To assess the lalse positive error rate, two comptementaiy 
approaches were used: a detailed analysis of one disease and 
a ^otial examination of 1000 diseases. The detailed approach 
examined the false posldve error rate and Its sources, whereas 
the global approach tested whether the overall results made 
biomedical sense. 

Using the LPF. 1467 genes related to prostate cancer were 
assembled In rank order. We then retrieved approximate^ 300 
Medline records each for the hlgh^t ranked 100 and the lowest 
ranked 200 genes and manually reviewed the titles and 
abstracts to determine the verity of tlie association. Ncarfy 80% 
of the highest ranked 100 genes fell into one of the fh^e 
categories that reflect meaningful gene -disease relationships 
(sec the Supporting Information, Supplemental Table 3. or visit 
http://hlp$eq.med.harvard.edu/MedCene/publlcation/ 
sJTable 3iitml}. Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
revkswed. there were only two In which the association between 
the gene and the disease was described as rtegadve. Both were 
genes with very low scores. In both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer."*" 

The colnckience of some gene symbols wilh medical ab- 
breviaUons, chemical abbreviations and biological abbrevia> 
tlons resulted in most of the false positives (see the Supporting 
InfomiaUon. SupplemenUl Table 4, or visit http://hipsG- 

q.mcd.harvard.cdii/McdCene/pubUcaUor)/sjrablc 4JiUti1), em- 
pliasizing the importance of the filters that were added in the 
search algoritlun (Tatile 1). Without the niters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For cxampk*, among the papers 
about breast cancer, there were only 12 MedUne records that 
referred lo ESRJ and 10 to ESRZ, whereas almost 2000 papers 
mentioned estrogen receptor without specifying £S/?J or ESRZ, 
this latter group was detected by the family stem temi filler. 

To further validate these rcsulis, a g}ol)al analysis of the gene- 
disease relationships dcscrilicd McdCcne was performed. 
For this experiment. It was reasoned that the more ctosely 
related tlic diseases arc to one another, the more they will be 
related to the same gene sets. Thus, if the rclaUonshlps defined 
by MedCene accurately reflected the literature, tlien an unsu- - 
pervised lilcnirchical clustering of the gene claia should group 
diseases In a manner consistent with common mcdk:al think- 
ing. Conversely, If the clustered diseases do not make sense 
biologically or medically. It may reflect excessive false positives, 
false negatives, or Inappropriate scoring of the data. 

To execute this experiment, tlie gene sets and the corre- 
sponding LPF values for 1000 rarKlomly selected diseases (eadi 
with at least 50 gene relationships) were used as a datasct for 
clustering the diseases. A review of the results showed thai the 
resulting discast? dusters were bidced logical based upon 
common medical knowledge (sec the Supporting bifonnation. 
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Suji^lemental Figure 1 , or visit http^/l^ipseq jnediiarvanLedu/ 
MedCene/publlcatlon/s^lgure IJitml). For example. In one 
such duster shown In Flgpre 2. diabetes and its complications 
grouped together and were also dosely linked to diseases 
associated with slarvatloti states. 

The number of genes associated wKh a ^en disease can. 
be estimated by adjusting the MedCene number up by the false 
negative rate (^9%) and down by the false poslthre rale ('^26% 
on average). Using thb. the average disease has 103.7 -k 45.3 
(mean db S4l.) genes associated with It, although the range Is 
quite broad with 2359 genes related to breast cancer. 2122 
genes related to lung cancer and no genes rebled to a number 
of diseases. 

Applying MedGene to the Analysb of Large Datasets. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunlry to analyze data obtained from 
a high-throughput experiment We compared the MedGene 
breast cancer gene Ustlo a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis Identified 
2286 genes that had greater than a 1-fold dlfTcrcnce In mean 
expression level between breast cancer samples and normal 
breast samples. Using MedCene. we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer In the 
literature by gerw term search (first-degree association by gene 
name): 328 genes directly linked by family term search (first- 
degree assodation by family term): 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association); and 505 genes not previously associated with 
breast cancer. (Sec the Supporting Information. Supplemental 
Figure 2. or visit hUp://hipseqjned.harvard.edu/MedCene/ 
publlcatlon/sj^lgure 2.html.) Among the 505 previously un- 
related genes, 467 were either newly identified genes or genes 
that had not previously been associated wttli any disease. 
Among the rcmalnbig 38 genes. 9 had been related to other 
cancers, specifically esophageal, colon, uterine, skin, and cervix. 

To determine whether the genes highlighted by the micro- 
array analysis were more likely to have been previously linked 
lo breast cancer in the Uterature, we created a tuv-dimcnsional 
plot of the fold change of cxpiesston level between breast • 
cancer and normal tissue veisus the literature score (LPF) 
(Figure 3A). There was a l>road spread of expression changes 
among the genes dlrccQy linked to breast cancer ranging from 
less than I -fold change (68%) to over 40-fold (0.3%). Notably, 
the majority of genes with greater than 10-fold expression' 
changes were linked to breast cancer by first-d^rcc associa- 
tion. 

Among all 754 genes dbecUy linked to breast cancer In the 
literature, there was no correlation l>etwccn LPF and mlcro- 
anay fold change (r = 0.018. p- value = 0.62). However, when 
we siratlficd the analysis based on the magnliude of the fold 
cliangc. we observed an increasing trend in correlation (Figure 
3B) suggesting that genes with a more substantial change in 
expression level were more likely to have a stronger association 
in Uic liientture. For genes that had IO*fold change or more In 
expression level, tlie corTelatk>n Increased to 0.4 1 (p-value » 
0.0S). 

When we evaluated the micro* array data separately for ER 
positive and ER negative tumors, the trcrKi In correlation 
between fold change and lUcralure score was liighly dependent 
on estrogen receptor status. Ini crest Ingly, there was a similar 
trend In conclation for ER positive tumors, but no trend In 
correlation for ER negative tumors. 
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Figure 2. Global validation by clusiering analysis. 2(A). The gene sets and the corresponding LPF values for 1000 diseases, each with 
ai least 50 gene relaiionships. wer^ used in an unsupenrised clustering ol the diseases based on the gene patterns associated with 
them A sample of the data Is shown here. 2(B). One of the resulting clusters Is shown that corresponds to blood sugar stales. Diabeies 
terms (above the line) and starvation states terms (under the line) clustered together. Within these groups* there Is also clustering of 
diabetic small vessel complications, altered serum chemistries, nutritional disorders. etc,(SuppTemenial Figure 1: hHp://hipseQ.med. 
hdrvard.edu/MedGerte/^bnGatlon/s.Figure 1.hlml). 



Finally, lo vaHdatc our findings, wc computed similar cor- 
relations between the bieasi cancer expression data and 
LPF scores generated by McdCcnc for liypertensSon. a 



disease unrelated to breast cancer. As expected, wc did not 
observe an increasing trend in correlation for hyperten- 
sion. 
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Figure 3. Relaiionship between literature score artd (urtctlonal data ror breast cancer. 3A. The data from an expression analysis of 
samples for breast tunnors and normal breast tissue were analyzed to Indicate the fold difference of expression level between breast 
tumor and ruxmal sample (cutoff > 3.fotd change). The fold changes were plotted against the literature score for the same gene set. 
Green dots represent first-degree association by gene search, blue dots represent first-degree association by faml^ search and red 
dots represent rro-assoclatlon. Some well-studied genes* such as BRCA2 (pink circle), are not reflected by a substantial difference In 
expression level. Furthermore, the majority of genes that have no assodatbn with breast cancer in the literature had less than 10-fold 
expression changes (shaded area). 3B. The Spearman rank-correlation coefficients between literature score (LPF) and the fold change 
of expression level between tumor and normal breast samples (/>8xis) in relation to the amount of fold change of expression level 
(x-axIs). Ger»e rank lists were generated for breast carver (blue) and hypertension (jpink). Correlations were also computed between 
the breast cancer gene LPF scores and fold change expression data among esuogen receptor positive tumors only (light blue) and 
estrogen receptor negative tumors only (purple). 
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Table 2. Top 25 Genes Related to Selected Human Diseases* 
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• ModCcnc results for the ti^ 25 genes usodatMj wHh breast ncoplasins. bypcflendon. rbcumalotd ailbritU. bipolar dbonfcr. and alhttosclcfoalx rcspcctlvdy. 
ranked by LFF scores. The hyperlink lo All the papers co-dUng the gene imd the disease ts available at MedGene website (hltp:/yhipseq.medJiarrard«du/ 
McdCexie/). 

Discussion 

The Human Genome Projccl lieralded a new era in biological 
research where (he emphasis on undenstandlng specific path- 
ways has expanded to global studies of genomic organization 
and biological systems, Hlgh-througl^ul technologies can 
provide novel insight into comprehensive biological function 
• but also Introduces new challenges. The utility of these 
. technologies is limited to the abOlly lo generate, analyze, and 
Jnlerprel lar^e gene lists. MedGcne. a relational database 
derived by mining liw InformaUon In Medline, was created to 
address this need. MedCenc users can query for a rank'Ordcreil 
list of human gene-disease relationships (Table 2) for one or 
more diseases. Each cntiy Is hyperllnked to the original papeis 
supporting each association and to other relevant databases. 

MedCene Is an Innovative extension of previous text mining 
approadics. Percz-IraUcta el al. used the CO annotation and 
their chromosomal locations lo prcdia genes that may con- 
tribute to inherited disorders.* MedCene takes a l>roader view 
and includes all diseases and ail possible gene-dlsoasc letallon- 
shjps. Furthermore. MedCene utilizes co-citallon lo indicate a 
relationship rather Uian CO annotation, which is limited to the 
subset of genes tliat have CO anrratation. Our approach is 
complementary to that taken by Chaussabd and Sher. wl>o 
used the frequency of co-citcd terms lo duster gcrtcs into a 
liicrarchy of gene-gene rcLitJonships.* 

A unique aspect of this tool is the ability u> assc^ the relative 
strengths of gcnc'disGasc relationships based on Ihe frequency 
of boUi co-citaUon and single citation. This presupposes that 
most co-citations describe a positive association, often rcrcrrcd 
to as publication bias'* and is supported by our observations 
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Uiat negative associations are rare (Supplemenul Table 3: 
http://iilpseqjncd.liarvard.edu/MedCcne/publlcalion/s_Ta- 
bte 3Jitm9. Of course, relationships estabUdied by frequency 
of co-diatlon do not necessarily represent a true biologic^ link: 
however, it is strong evidence to support a true relationship. 

Another Important feature of MedCene is the implementa- 
tion of sofhvare fiiteis that subs tan tiaUy reduced the error rate. 
We estimate tl^t less than 10% of all assodations were missed 
and at least 70% of even the wealcest associations were real. 
For this study, ail of the fillers that we applied were general 
ones. eg., expanding the list of ail gene names to address the 
dUferem syntax fonns used by different joumab. eliminating 
gene names that correspond to common English words, etc. 
The majority of ttie remaining search term ambiguities were 
IdiosyrKralic and difficult to Identify ^stcmatlcally without 
causbig a slgniflcant rise in false negatives. Alternative ap- 
proaches, such as the examination of the nearest neighbor 
tcnns. need to be considered to further reduce the false positive 
rate. 

It is not uncommon to sec expression changes in micro- 
array experiments assmall as 2-fold reported in the literature. 
Even when these e3q)ression changes are statistically significant, 
it is not always dear if they are blologicaUy meaningful. When 
comparing expression leveb of disease to normal tissue, one 
expects an enrichment of kru>wn disease-related genes to 
appear in the altered expression gr6up. MedCene provided a 
unlf]ue opportunity to test this notion in the context of csxlstlng 
knowledge on a novel breast cancer micro-array dalasct. For 
genes displaying a 5-fold change or less in tumors comf)ared 
to normal, there was no evidericc of a correlation between 
altered gene expression and a known role in the disease. This 
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Table 3. Genes with Large Expression Changes in ER- tM 

Not In ER+ Breast Tumors 
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Table 3. McUCenc lifcutiDcd a set of rebUvdy ttndcrshtdled, >t( highly 
expressed genes In ER ncgaUve. bm not ER posiUve breast tumors. All of 
tiicsc genes haw cHhcr never been co-dtcd with breast cancer or have a ' 
weak assocLilion except those matkcd with nn '. 



reflects the many gcniis whose role In breast cancer may not 
involve large changes !n expression In sporadic tumors (e.g., 
DKCAl and BRCA^ and genes whose modest changes In 
expression may be unrelated to the disease. StnJdngly, among 
genes with a 10-fold change or more in expression level, there 
was a sUong and signlOcant corretaUon between expression 
level and a published role in the disease, providing the first 
^obal validation of the micro-array approach to Identifying 
diseasc-spedfic genes. 

Tlie results derived from MedCcne have two Implications. 
First, a careful hunt for corroborating evidence of a role m 
breast cancer sliould precede any further study of genes with 
less than 5- fold expression level changes. Second, any genes 
wlih lO'fold changes or more are likely to be related to breast 
cancer and warrant attenUon. It is likely that this thr^hold will 
change depending on the disease as well as tlic experiment 

Interestingly, the observed corrdalion was only (bund among 
ER-posiiive tumors, not ER-negatlvc. This may reOect a bias 
in the literature to study the more prevalent type of tumor in 
tlic population. Furthermore, this emphasizes that caution 
must be taken when inierpreling experiments that may contain 
subp>opulaaons that behave very differently. Tlic MedCenc 
approach identlDcd a set of relatively understudied, yet highly 
expressed genes bi ER-ncgative tumors tiial arc wortliy of 
furtJwr examination (Table 3). 



In coru^lon, we have developed an automated method of 
stmunarlzing asid oig^urilzlng the vast biomedical Uteratur^ 
our knowledge the resulting database b the most conipreheiv 
shre and accurate of Its Und. By gmratix^ ascoie that reflects 
the strength of the association, il piovides an linportanl toot 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments. Faitheraiore. it can 
be used for selecting subsets of genes for lunctionai studies, 
for bunding disease-specific arrays, for looking at genes com- 
mon to multiple diseases and vattotis other hlgh-throu^put 
applications. Jn the future. It ^11 be possible to enhance the 
utflUy of the MedCene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell dhislon and responses to 
small molecules. 
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ABSTRACT The consistent cytogcnclic translocation of 
chronicmydogenous leukemia (the Philadelphia chromosome, 
Ph ) has been observed In cells of multiple hematopoietic 
lineages. This translocation creates a chimeric gene composed 
of breakpoiht-dtister-regton {bcr) sequences from chromosome 
21 fused to a portion of the abt oncogene on chromosome 9. The 
resultfaig gene product (P210*^") resembles the transformtog 
protein of the Abelson murine leukemia virus in Its structure 
and tyrosine kinase activity. PlIO' *" Is expressed In Ph'- 
posUive ceil lines of myeloid lineage and In clinkal specimens 
witli myeloid predominance. We show here that Epstefai-Barr 
^ims-transformed B-lymphocyte lines that rclafai Ph* can 
express PllO'"". The level of expression In these l)<cll lines Is 
generally lower and more variable than that observed for 
myeloid Ihies. Protebi expression Is hot related to amplification 
of the abl gene but to variation bi the level of bcr-abl mRNA 
produced from a single Ph' template. 



Chronic myelogenous leukemia (CML) is a disease of the 
pluripotent stem ceU (1). In greater than 95% of patients, the 
leukemic cells contain the cytogenetic marker known as the 
Philadelphia chromosome, or Ph* (2). This reciprocal 
translocation event between the long arms of chromosomes 
9 and 22 has been used as a disease-specific marker for 
diagnosis and evaluation of therapy. Multiple hematopoietic 
lineages, including myeloid and B-lymphoid, contain Ph^ in 
early or chronic phase, as well as in che more acute accel- 
erated and blast crisis phases of the disease. 

One molecular consequence of Ph' is the iranslocation of 
the chromosomal arm containing the c-abl gene on chromo- 
some 9 into the middle of the breakpoint-cluster region {bcr) 
gene on chromosome 22 (3-6). Although ihc precise 
translocation breakpoints arc variable, an RNA-splicing 
mechanism generates a very similar 8-kilobasc (kb) mRNA in 
each case (5-9). The hybrid bcr-abl message encodes a 
structurally altered form of the abl oncogene product, called 
P2{Qc abi (10-13), with an amino-terminal segment derived 
from a portion of the exons of bcr on chromosome 22 and a 
carboxyl-terminal segment derived from a major portion of 
the exons of the c-abl gene on chromosome 9. The chimeric 
structure of bcr-abl^nd the resulting P210*' "^' is similar to the 
structure of the Abelson murine leukemia virus gag-abl 
genome and resuhing P160^«^' transforming gene product. 
Both proteins have very similar tyrosine kinase activities (10, 
II, 14) which can be distinguished by their relative stability 
to denaturing detergents and by their ATP requirements from 
the recently described tyrosine kinase activity of the c-abl 
gene product (15). 
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In concert with structural modification of the amino- 
terminal ponion of the abl gene, increased level of expression 
has been implicated in activation of c-abl oncogenic poten- 
tial; Myeloid and erythroid cell lines and clinical samples 
derived from acute-phase CML patients contain about 10- 
fold higher leycls of the 8-kb bcr-abl mRNA and P210^*^ than 
the c-abl mRNA forms (6 and 7 kb) and P145****!* gene product 
(5,8,9, 11). The higher level of expression of the chimeric 
bcr-abl message in acute-phase cells is not likely lo be solely 
due lo the presence of the bcr promoter sequences at the 5' 
end of the gene, since the normal 4.5-kb and 6.7-kb bcr- 
encoded mRNA species are expressed at an even lower level 
than the normal c-abl messages (5, 6). 

We have analyzed a series. of Epstein-Barr virus-ininor- 
lalized B-lymphoid cell lines derived from CML patients ;16). 
With such in vitro clonal cell lines, we can evaluate whether 
the presence of Ph* always resuhs in synthesis of the chimeric 
fccr-flM message and protein, and whether the quantitative 
expression varies for cells of B-lymphoid lineage as com- 
pared to previously examined myeloid cell lines. Our results 
show that ceU lines that retain Ph^ do express bcr-abi message 
and protein, but that the level is generally lower and more 
variable than previously seen for myeloid cell lines. The 
demonstration that the Ph^ chromosomal template can vary 
in its level of expression of P210^-"" suggests that secondary 
mechanisms, beyond the translocation itself, contribute to 
the regulation of the bcr-qbl gene in different cell types or 
subclones that derive from the affected stem cell. 

MATERIALS AND METHODS, 

Cells and Cell Labclings. Epslcin-Barr virus-Uansformed 
B-lymphoid cell lines were established from peripheral blood 
samples of chronic- and acute-phase CML patients as report- 
ed (16). The cell lines are designated according lo patient 
number, karyotype, and lineage. For example. SK- 
CML7Bt(9,22)-33 refers to CML patient 7, B-lymphoid cell 
line. 9;22 iranslocation (Ph^), cell line 33; and SK-CML7BN- 
2 refers to B-cell line 2 with a normal karyotype derived from 
the same patient. Repeat karyotype analysis was performed 
to verify the retention of Ph* just prior to analysis tor ab! 
protein and RNA. Cells were maintained in RPMI 1640 
medium with 20% fetal bovine serum. We have not observed 
any consistent pattern oi in vitro growth rate that correlates 
to the stage of disease at the time of transformation with 
Epstein-Ban virus. Cells (1.5 x 10') were washed twice with 
Dulbecco's modified Eagle s medium lacking phosphate and 



Abbreviations: frrr. breakpoint-cluster region; CML. chronic 
myelogenous leukemia; kb. kilobasc(s}. 
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supplemented with 5% dialyzed feial bovine senim. Cells 
were then resuspended in 2 ml of the minimal medium. 
Labeling was started with the addition of |"PJorthophoS' 
phate (1 raCi/ml; ICN; 1 Ci = 37 OBq) and continued at 37^ 
for 3-4 hr. 

Imnninopredpltation and ImniDnobtotlfaig. Imnuinoprecip* 
itations were carried out as described (10). Cells (1.5 x }0^ 
were washed with phosphate-buffered saline and extracted 
with 3-5 ml of phosphate lysis buffer (1% Triton X-100/0.1 
NaDodSO4/0.5% deoxycholatc/10 mM Na2HP04, pH 7.5/ 
100 mM NaCI) with 5 mM EDTA and 5 mM phenylrocthyl- 
sulfonyl fluoride. Extracts were clarified by ccntiifugation 
and precipitated with normal or rabbit anti-abi sera (anti- 
pEX-2 or anli-pEX-5) (17). The precipitated proteins were 
elcctrophoresed in a NaDodS04/8% polyacrylamide gel. 
^'P-labeled proteins were detected by autoradiography. 
Alternatively, abl proteins were detected by immunoblotting. 
Extracts from unlabeled cells were clarified, and proteins 
were concentrated by inununoprecipitation with rabbit anti- 
sera against oM-encoded proteins (anti-pEX-2 and anti-pEX- 
5 combined (17)) and then fracttonaled in 8% acr>'lamide gets. 
The proteins were Iransferred from the gel to nitrocellulose 
fillers, using protease-facilitaied transfer (18). The abl- 
encoded proteins were detected using murine monoclonal 
antibodies as a probe and pcroxidase-conjugatcd goat anii- 
mouse second stage antibody (Bio-Rad) for development. 
Rabbit anttsera and mouse monoclonal antibodies to abl 
proteins were prepared using bactcrially expressed regions of 
the y abl protein as immunogens (17. 19). Ami-pEX-2 anti- 
bodies react with the internal lyrosinc kinase domain and 
anli-pEX-5 antibodies react with the carboxyl-terminal seg- 
ment of the abl proteins. 

RNA Analysis. RNA was extracted from 10" cells by the 
NaDodSO^/urea/phenol method (20). Polyadenylylaied 
RNA was purified by oligo(dT) affinity chromatography. 
Samples were electrophoresed in a 1% agarose/formalde- 
hyde gel and transferred to nitrocellulose, abl RNA species 
were detected by hybridization with a nick-translated ^-abl 
fragment probe (21). 

DNA Analysis. DNA was prepared from 5 x 10' cells of 
each cell line and processed for Southern blots with a \-abl 
probe as described (21). 

RESUI.TS 

Variable levels of P210^"*»' Are Delected in Ph*-Positive Cell 
Lines. Ph^-positive and Ph^-negaiive. Epsicin-Barr virus- 
transformed B-lymphocytc ccU lines derived from the same 
patient were examined for VIW"^ synlhesis by immuno- 
precipitation of ["PJorthophosphale-labeicd cell extracts 
with anti-abl sera (Fig. 1). The normaLc ofc/ protein P145*-'*^ 
was detected at a similar level in multiple Ph^-positive and 
Ph*-negalive cell lines. P2W***>' was only delected in the 
Ph*-positive cell lines because the bcr-abl chimeric gene 
which encodes V2\{3F'^^^ resides on the Ph» (4, 5, 11. 13). The 
level of P210^ *" was about 4- to 5-fold higher than the level 
of P145C-.W in ti^g SK-CML7Bt-33 cell line (Pig. L4, +). The 
Ph^-positivc crythroid-progenitor cell line K562 (C) showed 
a level of P210*=»*»' about lO fold higher ihan P145'^'»»>*. 
However, the level of P210^ **»' w as aboui onc-fifih that of 
P145*= »»''inlhePh'-posiiiveSK-CML16Bt-l cell line (Fig. \B, 
+). Comparison of different autoradiographic exposures 
roughly indicated that the level of P210'' ''" varies over a 
20-fold range between these Ph'-positive B-ceU lines. Anal- 
ysis of four additional Ph*-posiiive B-cell lines dcmonstraicd 
that the level of P210' '" fell into two general classes- some 
cell lines had a level of P210^-*^' similar to SK CML7Bt-33 
and others had the low level simihu^ lo SK-CMLI6B1-I (Table 
1). This differs from previous studies with !*h'-positive 
myeloid cell lines and patient sample!^ derived from acute- 
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Fig, 1. Detection of variable levels of P210^»*^ in Ph'-positive 
B-ccIl lines. Production of P145*^" and P210'-" in Epstcin-Barr 
virusttransformed B-ccll lines derived from a blast>crisis (>!) and a 
chronic-phase (fi) CML patient was examined by metabolic lat>eling 
with ["Plorthophosphate and immunoprecipitation. Ph*»ncgaiivc 
(-) and Ph^-positlve (+) cell lines derived from each paticni were 
analyzed. The Ph*-negative cell line in A Is SK-CML7BN-2 and in 
B," is SK-CML16BN-1. The Ph»-posiiivc cell line in .4,+ is 
SK-CML7Bt-33 and in /Jf.+ is SK-CMLl6Bt:i. The K562 cell line, a 
Pb*-positive erythroid progenitor cell line spontaneously derived 
from a blast-crisis patient (33)» is represented in C. Cells (1.5 x 10') 
were metabolically labeled with 2 mCi of t^P)orthophosphate for 3-4 
hr and then were exuacted and clarifled by ccntrifugation. Samples 
were immunoprecipitated with control normal serum (lanes 1), 
anii-pEX-2 (lanes 2). or anti-pEX-3 (lanes 3) and analyzed by 
NaDodS04/8% PAGE followed by autoradiography with an inten- 
sifying screen (3 days for A and C. 10 days for B). 

phase CML patients, in which P210^'**'* was delected at a 
lO-fold higher level than P145'-"" (refs. 10 and 11; Table 1). 
There was no large difference in level of chimeric mRNA and 
P2iQe-abi expressed in four myeloid/erythroid-lineagc Ph^- 
positivc cell lines (K562. EM2. EM3. CML22, and BV173; 
refs, 9 and 11), despite a 4- to 5-fold amplification of 
aM-related sequences in the K562 cell line. 

Detection of different levels of P210''-»^* in Fig. 1 could be 
due to decreased phosphorylation of P210*^-^*, a lower level 
of P21(r-*^' synthesis, or altered stability of the protein. To 
help distinguish among these possibilities, the steady-state 
level of P210'^ "^' in the cell lines was assayed by immuno- 
blotting. The results show that SK-CML7Bit33 (Fig. 2/t, +) 
had a higher level of P210' '^' than P145. similar to the results 
with metabolic labeling (Fig. 1). We did not detect P310'^"*^ 
by immunoblotting with 2 x IQ' cells of line SK-CML8BI-3 
(Fig. 2B, +). Reconstruction experiments using dilut ons of 
cell extracts showed that we could detect about 5-109f.- the 
level of P210^ expressed in the K562 cell line (data not 
shown). We infer that the steady-state level of P210* "" in 
SK-CML8Bt-3 is lower than the level in SK-CML7Bi-33 by 
a factor of at least 10. The level of P210^ **" detected in these 
assays correlated with the amount of P210^ *" tyrosine kinase 
activity that could be detected in vino (data not shown). 

Different Levels of P210' '" Arc Reflected in the Amouni of 
Stable bcr-abl mRNA. To identify the basis for detection of 
variable levels of r210''"^\ we examined the produciion of 
the abt RNA. RNA blot hybridiiation analysis using ii \ -ohl 
probe (Fig. 3) showed thai the normal 6- and 7>kb <:-abl 
mRNAs were present at a similar level in Ph^-posiii/e and 
-negative cell lines derived from different patients. HoAcvcr, 
the 8-kb mRNA that encodes P210=-**' was detected at a 
10-foId higher level in SK CML7Bt.33 (Fig. 3A, +) than in 
SK-CML16Bt-l (B, +). which correlated with the relative 
level of P210'^='^' delected in each cell line. Analysis of 
additional cell lines demonstrated that the level of S kb RNA 

dirccily correlated with ihe level of P210""'^ (Table 1). The 
variation in level of 8-kb RNA detected in these cell lines was 
not due 10 loss or gain of Ph'. because cytogenetic aiialysis 
confirmed the presence of Ph* in these cell lines (rcf. 16 and 
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Table L Relative levels of bcr-abl expression in Epsteio-Barr. 
virus-immortalized B-ccU Une s and myeloid CML lines 
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*CeU lines derived from CML patients by transformation with 
EpsteiiwBarT vims as described (16). Names of cell tines indicate 
patiem number and Ph> status: SK<:ML7Bt indicates a cell line 
<]erived from patient 7 that canks the 9;22 Ph* translocation; N 
indicates a normal karyotype. Myeloid-erythroid cell lines (K562, 
EM2. and BV173) are described in previous publications (9. 11. 22, 
33). 

^Status of patient at the time cell Une was derived. BC. blast crisis; 
Acc, accelerated phase. 

♦presence ) or absence (-) of Ph' as demonstrated by karyoiypic 
or Southern bloi analysis. 

iV2](f^^ detected as described in legend to Fig. 1. B-ccU lines 
derived from blast-crisis and acccleratcd-phasc patients had levels 
of P210 3' to 5-foId higher (+ + + ) than levels of P145. Chronic- 
phase-derived cell lines had P210 levels lower than or just cquivalcni 
(+) to the level of ?U5. Myeloid and erylhroid lines had levels of 
P210 5- to 10-fold higher than P145 (+ + + + +). 

^Eighl-kilobase bcr-abl mRNA delected as described in legend to 
Fig. 2, Symbols: ±, borderline dcieciablc; + + + + + , level of 8-kb 
mRNA 5- to 10-foId higher than that of the 6- and 7-kb c-afr/mRNA 
species; + + + . level of 8-kb mRNA 3- to 5- fold higher than that of 
the 6- and 7-kb species; + , a level approximately equivalent to that 
of the 6- and 7-kb messages. 

data not shown). There was no difference in the copy number 
of flW-relatcd sequences as judged by Southern blot analysis 
(Fig. 4). Only the K562 cell line control showed an amplifi- 
cation of ab\ sequences, as previously reported (22» 23). 
These combined data suggest that differential 6cr-aM mRNA 
expression from a single gene template is responsible for the 
variable levels of P210=-*" delected. This could be mediated 
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Fig. 2. Analysis of steady-state abl protein levels by immuno- 
blotting. Cell extracts prepared from 2 x 10^ cells of lines SK 
CML7BN-2 M,-). SK.CML7Bt-33 U. + ). SK-CMLBBN-IO (fi,-). 
and SK-CML8Bt-3 05,+) were concentrated by tmmunoprecip- 
ilation with anii-pEX-2 plus anti-pEX-5. Samples were then electro- 
phorcscd in a NaDodS04/8% polyacrylamidc gel and transferred to 
nitrocellulose, using protease-facilitiiled transfer (18). abl proteins 
were delected using a mixture of lu-o monoclonal antibodies directed 
against the pEX-2 and pEX-5 oW-proicin fragments produced in 
bacteria (19) as a probe and a peroxidase-conjupaied poat anti-mouse 
second-stage antibody (Bio-Rad) for development. 
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Flo. 3. Comparison of abl RNA levels in Fh>-posittve and 
•negative B-ceU Unes. The levels of the normal 6- and 7-kb c-abl 
RN As and the 8-kb bcr-abi RNA were analyzed by Mot hybridization 
using a v-a6/ pn>be. RNA was extracted from Ph'-negative Unes 
SK-CML7BN-2 04.-) and SK-CML16BN-1 (B.-). from Ph*-pos- 
Itive lines SK.CML6Bi-33 and SK<:ML16Bt-3 and 
from Une K562 (C,+) by the NaDodSO,/urea/phenol method (20). 
Poiyadenylylaied RNA was purified by oUgo(dT) affinity chroma- 
tography, and 15 ^g of each sample was etectrophoresed in n 1% 
agarosc/formaldchydc gel and then transferred to nitrocellulose The 
blotted RNAs were hybridized with a nick-translated v-«W fragment 
probe (21) and (hen auioradiographed for 4 days. 



by factors inlluencing the transcription rate of the bcr-abl 
gene or the stability of the mRNA. 

DISCUSSION 

Several Hncs of evidence suggest that formation of Ph^ is not 
the primary event that affects the stem cell in CML. Patients 
have been identified that present with the clinical picture of 
CML but only later develop Ph' (1). This observation, 
coupled with studies of G6PD (glucose-6-phosphaic dehy- 
drogenase)-heiero2ygous females with CML that demon- 
strate stem-cell clonality by isozyme analysis among cell 
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Fig. 4. Southern blot analysis of abl sequences in Ph*-posiiive 
and -negative B-cell lines. High molecular weight DNA (15 Mg) ^ as 
digested with restriction endonuclcasc BamWl, separated in a 0.8% 
agarose gel, and then transferred to nitrocellulose. The blotted DNA 
fragments were hybridized ^^ith a nick-translated, lA\bBgl W \' abl 
fragment (1.5 x lO" cpm/Mg; ref. 21) and exposed for 4 days (A) 
Auloradiogram of nW-specific fragments in cell lines HL-GO(:anc 1). 
EMZOane 2). K562 (lane 3). SK-CML7Bi-33 (lane 4). SK-CMLRBt-3 
(lane -5). SK-CML16Bt-l (lane 6). SK.CML21BI-6 (lane ). SK 
CML35Bt-2 (lane 8). SK-CML7BN-2 (lane 9). SK-CML8BN 2 -lane 
10). and SK-CML35DN-1 (lane 1 1). (B) Ethidium bromide sia ning of 
agarose gel prior to transfer to nitrocellulose, showing the ievr.i of 
variation in amount of DNA lo;tded per lane. 
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populations that lack the Ph' marker, supports a secondary 
or complementary role for Ph* in the progression of the 
disease (24, 25). This chromosome marker is found in 
chronic, accelerated, and Mast-crisis phases of the disease. It 
is likely that Ph* confers some growth advantage, since cells 
with the marker diromosome eventually predominate the 
marrow and peripheral blood even in chronic phase. During 
the phase of blast crisis, many patterns develop additional 
chromosome abnormalities, including duplicauon of Ph\ a 
variety of trisomies, and complex translocations (26). This 
is suggestive evidence for Ph^ being a necessary but not 
sufficient genetic change for the full evolution of the 
disease. 

The realization that one molecular result of Ph* is the 
generation of a chimeric bcr-abl protein with functional 
characteristics aind structure analogous to the gag-abl trans- 
forming protein of the Abelson murine leukemia virus 
strengthens the argument for an important role of Ph' in the 
pathogenesis of CML. Although the Abelson vinis is gener- 
ally considered a rapidly transforming retrovirus, its effects 
can range from overcoming growth factor requirements, lo 
cellular lethality, to induction of highly oncogenic tumors in 
a number of hematopoietic cell lineages (27. 28). Even in the 
transformation of murine cell targets, there are several lines 
of evidence that suggest that the growth-promoting activity of 
. the v-aM gene. product is complemented by further cellular 
changes in the production of the malignani-ccll phenol ypc 
(29-31). - F 

The regulation of bcr-abl gene expression is complex 
because the 5' end of the gene is derived from the non-abt 
sequences, bcr, normally found on chromosome 22 (6). The 
level of stable message for the normal bcr gene and the 
normal ab! gene are both much lower than the level of the 
bcr-abl message and protein from cell lines arid clinical 
specimens derived from myeloid blast-crisis patients (5, 6. 
11). Therefore, the high level of bcr-abt expression cannot 
simply be attributed to the regulatory sequences associated 
with bcr. Possibly, creation of the chimeric gene disrupts the 
normal regulatory sequences and results in a higher level of 
expression. Variation in bcr-abi expression may result from 
secondary changes in the strrjcture of the chimeric gene or 
function of /r<?«j-acting factors that occur during evolution of 
the disease. Our analysis of P210= »w and the S-kb mRNA in 
Epstein-Barr virus-transformed Ph'-positive B-cell lines 
demonstrates that stable message and protein levels from (he 
bcr-abl gene can vary over a wide range. This variation does 
not result from a change in the number of bcr-abl templates 
secondary to gene amplification but more likely from changes 
in either transcription rate or mRNA stability. We suspect 
this range of bcr-abl expression is not limited to lymphoid 
cells. Analysis of peripheral blood leukocytes derived from 
an unusual CML patient who has been in chronic phase with 
myeloid predominance for 16 years showed a level of 
''^r^^^^ ^^^^ °f P145-»^'. as detected by metabolic 
labelmg with r^PJorthophosphaic and immunoprccipitaiion 
(S.C., O.N.W.. and P. Greenberg. unpublished observa- 
tions). Lower levels of expression of the chimeric mRN A 
have been demonstrated in clinical samples from chronic- 
phase CML patients compared to acuic-phase CML patients 
(9). Others have reported chronic-phase paiienis wiih vari 
able but, in some cases, relatively high levels of ihc bcr-abl 
mRNA (32). The sampling variation and the heierogenous 
mixture of cell types in clinical samples complicate such 
analyses. Further work is needed to evaluate whether there 
IS a deOned change in P210^-^' expression during the pro- 
gression of CML. Ii is interesting to note chat among the 
hmilcd sample of Ph^-positive B-cell lines we have examined 
(Table I), we have seen higher levels of P210^^^' in (hose 
derived from paiiv - at more advanced stages of the disease. 
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It will be important to search for cell-type-specific mecha- 
nisms that might regulate expression of bcr-abl from Ph^. 
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1 Introduction 

A proteome has been defined as the protein complement 
expressed by the genome of an organism»or, in multicel- 
lular organisms, as the protein complement expressed by a 
tissue or differentiated cell [l|. In the most common im- 
plemenlation of proteome analysis the proteins extracted 
from the cell or tissue analyzed are separated by high 



Proteome analysis: Biological assay or data archive? 

In this review we examine the current state of proteome analysis. Tliere are 
three main issues discussed: why it is necessary to study proleomes; how pro- 
teomes can be analyzed with current technology; and how proteome analysis 
can be used to enhance biological research. We conclude that proteome anal- 
ysis is an essential tool in the understanding of regulated biological systems. 
Current technology, while still mostly limited to the more abundant proteins, 
enables the use of proteome analysis both to establish databases of proteins 
present, and to perform biological assays involving measuren^ent of multiple 
variables We believe that the utility of proteome analysis in future biologica 
research will continue to be enhanced by further improvements m analytical 
technology. 

resolution two-dimensional gel electrophoresis (2-DE), 
detected in the gel and identified by their ammo acid 
sequence. The ease, sensitivity and speed with which gel- 
separated proteins can be identified by the use of recently 
developed mass spectrometric techniques have drannati- 
cally increased the interest in proteome technology. One 
of the most attractive features of such analyses is that conri- 
plex biological systems can potentially be studied in their 
entirety, rather than as a multitude of individual compo- 
nents. This makes it far easier to uncover the many com- 
plex, and often obscure, relationships between mature 
gene products in cells. Large-scale proteome characteriza- 
tion projects have been undertaken for a number of dif- 
ferent organisms and cell types. Microbial proteome pro- 
jects currently in progress include, for example: Saccharo- 
myces cerevisiae 12). Salmonella enterica [31, Spiroptasma 
melliferum (4), Mycobacterium tuberculosis 15), Ochrobac- 
trum anthropi 16|, Haemophilus influenzae \ll Synecho- 
cystis spp. (81, Escherichia colt [9], Rhizobium legumino- 
sarum llO), and Dictyostetium discoideum 111). Proteome 
projects underway for tissues, of more complex organ- 
isms include those for: human bladder squamous cell 
carcinomas (12), human liver (13), human plasma (13), 
human keratinocytes 112). human fibroblasts (12], mouse 
kidney (12), and rat serum 114). In this manuscript we cri- 
tically assess the concept of proteome analysis and the 
technical feasibility of establishing complete proteome 
maps, and discuss ways in which proteome analysis and 
biological research intersect. 
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2 Rationale for proteome analysis 

The dramatic growth in both the number of genome 
projects and the speed with which genome sequences 
are being determined has generated huge amounts of 
sequence information, for some species even complete 
genomic sequences (115-17]). llie description of the 
state of a biological system by the quantitative measure- 
ment of system components has long been a primary 
objective m molecular biology. With recent technical 
advances including the development of differential dis- 
play-PCR 1 18), cDNA microarray and DNA chip techno- 
logy [19. 201 and serial analysis of gene expression 
(SAGE) 12 U 22], it is now feasible to establish global and 
qu'aniiiative mRNA expression maps of cells and tissues, 
in which the sequence o( all the genes is known, at a 
speed and sensitivity which is not matched by current 
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prolein analysis technology. Given the long-standing 
paradigm in biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
tions which arise are why technically complex proleome 
projects should be undertaken and what specific types of 
information could be expected from proteome projects 
Nvhich cannot be obtained from genomic and transcript 
profiling projects. We see three main reasons for pro- 
teome analysis to become an essential component in the 
comprehensive analysis of biological systems, (i) Protein 
expression levels are not predictable from the mRNA 
expression levels, (ii) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence^ and (iii) proteomes 
are dynamic and reflect the state of a biological system. 

2.1 Correlation between mRNA and protein expression 
levels 

Interpretations of quantitative mRNA expression profiles 
frequently implicitly or explicitly assume that for specific 
genes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharomyces cerevisiae 
growing at mid-log phase (S. P. Gygi et a/., submitted for 
publication). mRNA expression levels were calculated 
from published SAGE frequency tables 122]. Protein 
expression levels were quantified by metabolic radioia- 
beting of the yeast proteins, liquid scintillation counting 
of the protein spots separated by high resolution 2-DE 
and mass spectromelric identification of the protein(s) 
migrating to each spot. The selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted haif-life and expression level of the prolein pro- 
ducts. Thus far, we have found a general trend but no 
strong correlation between protein and transcript levels 
(Fig. I). For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-foId. Similarly, equivalent steady- 
slate prolein expression levels were maintained by trans- 
cript levels varying by as much as 40-fold (S. P. Gygi 
et ai^ submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
from the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

In the mature, biologically active form many proteins arc 
post-translalionally modified by glycosylation, phosphor- 
ylaiion» prenylalion, acylatton, ubiquitination or one or 
more of many other modincations |23| and many pro- 
teins are only functional if specifically associated or com- 
plexed Willi other molecules, including DNA, RNA, pro- 
teins and organic and inorganic cefaclors. Frequently, 
modifications are dynamic and reversible and may alter 
the precise three-dimensional structure and the slate ol" 
riciiviiy ol a prolein. Collectively, the state of modifica- 
tion i.>f the proteins which constitute a biological system 
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Figure I. Correlation between mRNA and prolein levels in yeast cells. 
For a selected population of 80 genes, protein levels were measured 
by '^-S-radiolabeling and mRNA levels were calculated Tronn publi- 
shed SAGE tables. Inset: expanded view of the low abundance region. 
For more experimental details, also sec Figs. 5 and 6, (S. P. Gygi et at., 
submitted). 



are important indicators for the stale of the system. The 
type of protein modification and the sites modified at a 
specific cellular state can usually not be determined 
from the gene sequence alone. 

2.3 Proteomes are dynamic and reflect the state of a 
biological system 

A single genome can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle and stales of differentiation, responses to 
growth and nutrient conditions, temperature and stress, 
and pathological conditions represent cellular states 
v^hich are characterized by signiftcantly different pro- 
teomes. The proteome, in principle, also reflects events 
that are under translalional and post-lranslational con- 
trol. It is therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- 
cription of the state of a cell or tissue, provided that the 
external conditions defming the slate are carefully deter- 
mined. In answer to the question of whether the study 
of proteomes is necessary for the analysis of biomolec- 
. ular systems, it is evident that the analysis of mature pro- 
tein products in cells is essential as there are numerous 
levels of control of prolein synthesis, degradation, 
processing and modification, which are only apparent by 
direct prolein analysis. 



3 Description and assessment of current proteome 
analysis technology 

3.1 lechnical requirements of proteome technology 

In biological systems the level of expression as well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the stale of the system. Comprehen- 
sive analysis of the identity, quantity and stale of modifi- 
cation of proteins therefore requires the detection and 



i» S'j/O 



1864* p. A. Haynes et at 




• 

quanlitation of the proteins which constitute the system, 
and analysis of differentially processed forms. There are 
a number of inherent difficulties in protein analysis 
which complicate these tasks. First, proteins cannot be 
amplified. It is possible to produce large amounts of a 
particular protein by over-expression in specific cell sys- 
tems. However, since many proteins are dynamically 
post-lranslationally modified, they cannot be easily am- 
plified in the form in which they finally function in the 
biological system. It is frequently difficult to purify from 
the native source sufficient amounts of a protein for 
analysis. From a technological point of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins are modified and pro- 
cessed post-translatipnally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isoforms also needs to be determined. The dis- 
tribution of a constant amount of protein over several 
differentially modified isoforms further reduces the 
amount of each species available for analysis. The com> 
plexity and dynamics of post translalional protein edit- 
ing thus significantly complicates proteome studies. 
Third, proteins vary dramatically with respect to their 
solubility in commonly used solvents. There are few, if 
any, solvent conditions in which all proteins are soluble 
and which are also compatible with protein analysis. This 
makes the development of protein purification methods 
particularly difficult since both protein purification and 
solubility have to be achieved under the same condi- 
tions. Detergents, in particular sodium dodecyl sulfate 
(SDS), are frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS polyacrylamide gel electro- 
phoresis (SDS-PAGE) over other protein separation 
techniques. Thus, SDS-PAGE and two-dimensional gel 
electrophoresis, which also uses SDS and other deter- 
gents, are the most general and preferred methods for 
the purification of small amounts of proteins, provided 
that activity does not necessarily need to be maintained. 
Lastly, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use methods which are as 
rapid as possible to allow completion of the project 
within a reasonable lime frame. Therefore, a successful, 
general proteomics technology requires -high sensitivity, 
high throughput, the ability to difTerenliate differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample. 

3.2 2-D electrophoresis ~ mass spectrometo': a common 
implementation of proteome analysis 

The most common currently used implementation of 
proteome analysis technology is based on the separation 
of proteins by two-dimensional (lEF/SDS-PAGE) gel 
electrophoresis and their sul^sequenl identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS), in 2-DE, proteins are first separ- 
ated hy isoelectric focusing (lEF) and then by SDS- 
f*AGE. in the second, perpendicular dimension. Separ- 
ated proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins. 2-DE gels arc, at present, the nuist commonly 
used means of global display of proteins in complex 
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samples. The separation of thousands of proteins has 
been achieved in a single gel f24, 25) and differentially 
modified proteins are frequently separated. Due to the 
compatibility of 2-DE with high concentrations of deter- 
gents, protein denaturanls and other additives promoting 
protein solubility, the technique is widely used. 

The second step of this type of proteome analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from polyacrylamide gels have traditionally 
been identified using AMerminal sequencing {26, 27], 
internal peptide sequencing (28, 29), immunoblotting or 
comigration with known proteins (30). The recent dra- 
matic growth of large-scale genomic and expressed 
sequence tag (EST) sequence databases has resulted in a 
fundamental change in the way proteins are identified by 
their amino acid sequence. Rather than by the traditional 
methods described above, protein sequences are now fre- 
quently determined by correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the information contained in sequence data- 
bases 131-33). 

There are a number of alternative approaches to pro- 
teome analysis currently under development. There is 
considerable interest in developing a proteome analysis 
stragegy which bypasses 2-DE altogether, because it is 
considered a relatively slow and tedious process, and 
because of perceived difficulties in extracting proteins 
from the gel matrix for analysis. However, 2-DE as a 
starting point for proteome analysis has many advan- 
tages compared to other techniques available today. The 
most significant strengths of the 2-DE-MS approach 
include the relatively uniform behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simultaneous display of hundreds to thousands of 
proteins within a reasonable time frame. 

A schematic diagram of a typical procedure of the identi- 
fication of gel-separated proteins is shown in Fig. 2. Pro- 
tein spots detected in the gel are enzymatically or chemi- 
cally fragmented and the peptide fragments are isolated 
for analysis, as already indicated, most frequently by MS 
or MS/MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins. 
They can be grouped into two categories, digestion in 
the gel slice [28. 34] or digestion after eleclrolransfer out 
of the gel onto a suitable membrane ((29, 35-37] and 
reviewed in (381). In most instances either technique is 
applicable and yields good results. The analysis of MS or 
MS/MS data is an important step in the whole process 
because MS instruments can generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number of groups have developed 
software systems dedicated to the use of peptide MS 
and MS/MS spectra for the idenlincation of proteins. 
Proteins are identified by correlating the information 
contained in the MS spectra of protein digests or 
MS/MS spectra of individual peptides with data con- 
tained in DNA or protein sequence databases. 

Tlie systems we are currently using in our laboratory are 
based on the separation of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid chromatog- 
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Figure 2. Schematic diagram of a procedure for identiflcation of gel- 
separated proteins. Peptides can either be separated by a technique 
such as LC or CE, or infused as a mixture and sorted in the MS. Data- 
base searching can either be performed on peptide masses from an 
MS spectrum, peptide fragment masses from CID spectra of peptides^ 
a combination of both. 
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raphy (39, 40) or capillary electrophoresis [41), the anal- 
ysis of the separated peptides by electrospray ioniza- 
tion (ESI) MS/MS, and the correlation of the generated 
peptide spectra with sequence databases using the 
SEQU EST program developed at the University of Wash- 
ington [32, 33). The system automatically performs the 
foilowing operations: a particular peptide ion character- 
ized by its mass-to-charge ratio is selected in the MS out 
of all the peptide ions present in the system at a parti- 
cular time; the selected peptide ion is collided in a colli- 
sion cell wiih argon (collision-induced dissociation, 
CID) and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CID spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
same mass as the peptide selected for CID; this correla- 
tion matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from 
which the peptide was derived. There are a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQU EST system 
because it is currently the most highly automated pro- 
gram and has proven to be successful, versatile and 
robust. 



3.3 Protein identincation by LC-MS/MS, capillary 
LC-MS/MS and CE-MS/MS 

it has been demonstrated repeatedly that MS has a very 
high intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge is the handling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal of peptide mixtures generated by the digestion 
of low nanogram amounts of protein, from gels into the 
MS/MS system without significant loss of sample or 
iniroduclion of unwanted contaminants. We employ 
'hree difTcri-ni sysicms lor introducing gel-purified sam- 
ples into an MS, depending on the level of sensitivity 



required. As an approximate guideline, for samples con- 
taining tens of picomoles of peptides, L€-MS/MS is 
most appropriate; for samples containing low picomole 
amounts to high femtomole amounts we use capillary 
LC-MS/MS; and for samples containing femtomoles or 
less, CE-MS/MS is the method of choice. 

33.1 LC-MS/MS 

The coupling of an MS to an HPLC system using a 
0.5 mm diameter or bigger reverse phase (RP) column 
has been described in detail 142]. This system has several 
advantages if a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated mode using an autosampler, thus maxi* 
mizing sample throughput and minimizing the need for 
operator interference. The relatively large column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and micro-sprayer [40], analyses can be per- 
formed on a small fraction of the sample (less than 5%) 
while the remainder of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used to analyze 
peptide fractions, such as scintillation of an introduced 
radiolabel, and this data can be correlated with peptides 
identified by CID spectra. 

» 

3.3.2 Capillary LC-MS 

An increase of sensitivity of approximately tenfold can be 
achieved by using a capillary LC system with a 100 pm ID. 
column rather than a 0.5 mm ID column as referred to 
above. Since very low flow rates are required for such 
columns, most reports have used a precolumn flow split- 
ting system for producing solvent gradients. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables the formation 
of reproducible gradients at very low flow rates (low 
nt/min) without the need for How splitting (A. Ducret 
et al., submitted for publication). Using this capillary 
LC-MS/MS system we were able to identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
were loaded onto the gel [40]. This system is as yet not 
automated and, like all capillary LC systems, is prone to 
blockage of the columns by microparticulates when ana- 
lyzing gel-separated proteins. 

3.3.3 CE-MS/MS 

The highest level of sensitivity for analyzing gel-sep- 
arated proteins can be achieved by using capillary elec- 
trophoresis — mass spectrometry (CE-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPE-CE) system which was used with triple 
quadrupole and ion trap ESl-MS/MS systems for the 
identification of proteins at the low femtomole to sub- 
femtomole sensitivity level (43, 44 j. While this system is 
highly sensitive, its operation is labor-intensive and its 
operation has not been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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Figure 3. Schcmalic illuslralion of a 
microfabricated analytical system for C£, 
consisting of a micromachined device, 
coated capillary clectroosmolic pump, 
and microelectrospray mierface. The 
dimensions of the channels and reservoir 
are as indicated in the text. The channels 
on the device were graphically enhanced 
to make them more visible. Reproduced 
from [451, ^'^h permission. 



microfabricated devices for the introduction of samples 
into ESI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass into which channels 
of 10-30 urn in depth and 50-70 iim in diameter are 
etched by using photolithography/etching techniques 
similar to the ones used in the semiconductor industry. 
(A simple device is shown in Fig. 3). The channels arc 
connected lo an external high voltage power supply 1451- 
Samples are manipulated on the device and off the 
device to the MS by applying different potentials to the 
reservoirs. This creates a solvent flow by electroosmotic 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valves or gates and without any external pumping, the 
flow can be redirected by simply switching the position 
of the electrodes on the device. The direction and rate of 
the flow can be modulated by the size and the polarity 
of the electric field applied and also by the charge state 
of the surface. 

The type of data generated by the system is illustrated in 
Fig. 4, which shows the mass spectrum of a peptide sample 
representing the tryptic digest of carbonic anhydrase at 
290 fmol/^L. Each numbered peak indicates a peptide suc- 
cessfully identified as being derived from carbonic an- 



hydrase. Some of the unassigned signals may be chemical 
or peptide contaminants. The MS is programmed to auto- 
matically select each peak and subject the peptide to CID. 
The resulting CID spectra are then used to identify the 
protein by correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
the samples, to automatically generate CID spectra of 
selected peptide ions and lo search sequence databases 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins can 
be identified at very low femtomolc level sensitivity at a 
rate of approximately one protein per 15 min. 

3.4 Assessment of 2-DE-MS proteome technology 

Using a combination of the analytical techniques de- 
scribed above we have identified the 80 protein spots 
indicated in Fig. 5. The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total cell lysate of the yeast strain YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteome analysis 
can penetrate towards the identification of low abun- 
dance proteins, we have calculated the codon bias of the 
genes encoding the respective proteins. Codon bias is a 
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figure 4. MS spectrum of a iryplic digest 
of carbonic anhydiase using Che microfa- 
bricated system shown In Fig. 3. 290 
fmol/ML of carbonic anhydrase iryptic 
digest was infused into a Finnig.in 1.CQ 
ion trap MS. Each peak was selected for 
CID, and those which wcic identified as 
containing peptides derived from car- 
bonic anhydrase arc numbered. Repro- 
duced from |45|. wiih permission. 
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Figure 5. 2-DE separation of a lysaie of yeasi cells, with identified proteins highlighted. The first dimension of separation was an IPG from 
pH 3-10, and ibc second dimension was a 10%T SDS-PAGE gel. Proteins were visualized by silver staining. Further details of experimental 
procedures arc included in S. P. Gygi et aL (submiiicd). 
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calculaled measure of the degree of redundancy of trip- 
let DNA codons used to produce each amino acid in a 
particular gene sequence. It has been shown to be a 
useful indicator of the level of the protein product of a 
particular gene sequence present in a cell [46]. The gen- 
eral rule which applies is that the higher the value of the 
codon bias calculated for a gene, Ihe more abundant the 
protein product of that gene becomes. The calculated 
codon bias values corresponding to the proteins identi- 
fied in Fig. 5 are shown in Fig. 6b. Nearly all of the pro- 
leins identified (> 95%) have codon bias values of > 0.2, 
indicating ihcy arc highly abundant in cells. In contrast, 
codon bias values calculated for the entire yeasi genome 
{Fig. 6a) show that the majority of proleins present in 
the proteome have a codon bias of < 0.2 and are Ihus of 
low abundance. 

This fi nding is of considerable importance in our assess- 
ment of the cunenl status of proteome analysis technol- 
ogy. I( is clear that even using highly sensitive analytical 
techniques, we are only able to visualize and idenlify Ihe 



more abundant proteins. Since many important regula> 
lory proteins are present only at low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing many more proteins than 
the approximately 6000 gene products present in yeast 
cells 116). In the analysis of, for example, the proteome 
of any human cells, there are potentially 50000-100 000 
gene products [47|. Inherent limitniions on the amount 
of protein that can be loaded on 2-DE, and the number 
of components that can be resolved, indicate thai only 
the most highly abundant fract ion of the many gene 
products could be successfully analyzed. One approach 
thai has been empfoyed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
sirips for the first-dimension separation of 2-DE 148). 
Since only those proteins which focus within the narrow 
range will enter the second dimension of separation, a 
much higher sample loading within Ihc desired range is 
pos.sible. This, in turn, can lead to the visualization and 
identification of less abundant proleins. 
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Figure 6. Calculated codon bias values for yeast proteins. (A) Distribu- 
tion of calculated values for the entire yeast proicome. (B) Distribu- 
tion of calculated values for the subset of 80 identified proteins also 
shown in Figs. I and 5. Further details of experimental procedures are 
included in 5. P. Gygi et ai (submitted). 



4 Utility of proteome analysis for biological 
research 

For the success of proteomics as a mainstream approach 
to the analysis of biological systems it is essential to 
define how proteome analysis and biological research 
projects intersect. Without a clear plan for the implemen- 
tation of proieome-lype approaches into biological re- 
search projects the full impact of the technology can not 
be realized. The literature indicates that proteome anal- 
ysis is used both as a database/data archive, and as a bio- 
logical assay or biological research tool. 

4.1 The proteome as a database 

The use of proteomics as a database or data archive 
essentially entails an attempt to identify all the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is relevant for each 
protein. The level of annotation can. of course, be exten- 
sive. Hie most common implementation of this idea is 
the separation of proteins by high resolution 2-DE, the 
identincation of each detected protein spot and the 
annolaiion of the protein spots in a 2-DE gel database 
formal. Tliis approach is complicated by the fact that it is 
difTiculi to precisely define a proteome and to decide 
which proicome should be rcpresenied in the database. 
In conlrasi (o the genome of a species, which is essen- 
tially siatic, the proteome is highly dynamic. I'rocesses 
such as difi'erenlialion. cell activation' and disease can all 
significinily chang.c ihe proleomc of a species. Tliis is 
illuslraled in l ig. 7. Tlie figure shows two high-resolu- 



tion 2-DE maps of proteins isolated from rat serum. 
Fig. 7A is from the serum of normal rats, while Fig. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment with an inflammation-causing agent (49). 
It is obvious that the protein patterns are significantly 
different in several areas, raising the question of exactly 
which proteome is being described. 

« 

Therefore, a comprehensive proteome database of a spe- 
cies or cell type needs to contain all of the parameters 
which describe the state and the type of the cells from 
which the proteins were extracted as well as the software 
tools to search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitatively 
describing the fate of each protein if specific systems 
and pathways are activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature of molecules specifically interacting 
with a protein as well as the rate of change of these 
variables should be described. Using these admittedly 
stringent criteria, there is currently no comlete proteome 
database. A number of such databases are, however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD (501 (accessible at http.://www.ypd.com) and 
the human 2D-PAGE databases of the Danish Centre 
for Human Genome Research [12] (accessible at http:// 
biobase.dk/cgi-bin/cetis). While neither can be con- 
sidered complete as not all of the potential gene pro- 
ducts are identified, both contain extensive annotation 
of supplemental information for many of the spots 
which are positively identified in reference samples. 

4.2 The proteome as a biological assay 

The use of proteome analysis as a biological assay or 
research tool represents an alternative approach to inte- 
grating biology with proteomics. To investigate the state 
of a system, samples are subjected to a specific proceess 
that allows the quantitative or qualitative measurement 
of some of the variables which describe the system. In 
typical biochemical assays one variable {e.g.^ enzyme 
activity) of a single component (e.g., a particular en- 
zyme) is measured. Using proteomics as an assay, mul- 
tiple variables (e.g., expression level, rate of synthesis, 
phosphorylation stale, etc.) are measured concurrently 
on many (ideally all) of Ihe proteins in a sample. The 
use of proteomics as an assay is a less far-reaching prop- 
osition than the construction of a comprehensive pro- 
teome database. It does, however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems and pathways, as long as the interpretation of 
the results takes into account that with current technol- 
ogy not all of the variables which describe the system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis as a 
biological assay is when a 2-Dl£ protein pattern gener- 
ated from the analysis of an experimental sample is 
compared to an array of reference patterns representing 
different states of the system under invesligation. The 
stale of the experimental system at the litne Ihe sample 
was generated is therefore determined by the quantita- 
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live comparative analysis of hundreds to a few thousand 
proteins. Comparative analysts of the 2-DE patterns fur- 
thermore highlights quantitative and qualitative differ- 
tnces in the protein profiles which correlate with the 
slate of the system. For this type of analysis it is not 
essential that all the proteins are identified or even visu- 



alized» although the results become more informative as 
more proteins are compared. It is obvious, however, that 
the possibility to identify any protein deemed character- 
istic for a particular state dramatically enhances this 
approach by opening up new avenues for experimenta- 
tion. 
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Proieome analysis as a biological assay has been success- 
fully used in the field of toxicology, to characterize 
disease stales or to study differential activation of cells. 
The approach is limited, of course^ by the fact that only 
the visible protein spots are included in the assay, and it 
is well known that a substantial but far from complete 
fraction of cellular proteins are detected if a total -cell 
lysaie is separated by 2-DE. Proteins may not be 
detected in 2-DE gels because they are not abundant 
enough to be visualized by the detection method used, 
because they do not migrate within the boundaries (size, 
p/) resolved by the gel, because they are not soluble 
under the conditions used, or for other reasons. 

A different way to use proteome analysis as a biological 
assay to defme the state of a biological system is to take 
advantage of the wealth of information contained in 
2-DE protein patterns. 2-DE is referred to as two-dimen- 
sional because of the electrophoretic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DE pattern. In addition to the two dimen- 
sions used to generate the protein patterns, a number of 
additional data dimensions are contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with other proteins, rale of synthesis or 
degradation indicate the activity state of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing different states is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it must be emphasized that 
in this type of experiment only a fraction of the cellular 
proteins is analyzed. Since many regulatory proteins are 
of low abundance, this limitation is a concern, particu- 
larly in cases in which regulatory pathways are being 
investigated. 

5 Concluding remarks 

In this report we have addressed three main issues 
related to proteome analysis. First, we have discussed 
the rationale for studying proteomesr Second, we have 
assessed the technical feasibility of analyzing proteomes 
and described current proteome technology, and third, 
we have analyzed the utility of proteome analysis for bio- 
logical research. It is apparent that proteome analysis is 
an essential tool in the analysis of biological systems. 
The multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their correct identi- 
ties, their relevant stale of modification and/or associa- 
tion and their amounts. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensitivity levels and at a high level of auto- 
mation of the analytical processes. A number of tech- 
nical challenges, however, remain. While it is currently 
possible to identify essentially any protein spots that can 
be visualized by common staining methods, il is ap- 
parenl th.ii without prior enrichment only a relatively 
small and highly selected population of long-lived, 
highly expressed proteins is observed. There arc many 
mure proteins in a given cell which are not visualized by 
such methods. Frequently il is the low abundance pro- 
teins thai execute key regulatory functions. 



We have outlined the two principal ways proteome anal- 
ysis is currently being used to intersect with biological 
research projects: the proteome as a database or data 
archive and proteome analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current proteome data- 
bases typically are limited to one celt type and one state 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of proteome analysis as a 
biological assay can provide a wealth of information, but 
it is limited to the proteins detected and is therefore not 
truly proteome-wide. These limitations in proteomics are 
to a large extent a reflection of the fact that proteins in 
their fully processed form cannot easily be amplified and 
are therefore difficult to isolate in amounts sufficient for 
analysis or experimentation. The fact that to date no 
complete proteome has been described further attests to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteome analysis will eventually 
become attainable. 
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effictendy than gene B.lh\s allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 
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FROM DNATO RNA 

Itanscription and translation are the means by which cells read out> or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molectde can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amotmt of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next diapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 



Portions of DNA Sequence Are Transcribed into RNA 

The iirst step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA — 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in two* respects: (1) the nucleotides in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence the name ribonu- 
cleic add) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like T, can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). It is not imcommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-sUanded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have suuctural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

AU of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease. (A) Schematic illustration of the type of 
conformational change In a protein that produces material for a cross-beta filament. (B) Diagram illustrating 
the seK-infiectious nature of the protein aggr^tlon that is central to prion diseases. PrP is highly unusual 
because the misfblded version of the prot«n» called PrP*, induces the nbrmal PrP protein it contacts- to 
change its conformation, as shown. Most of the human diseases caused by protein abnegation are caused by 
the overproduction of a variant protein that b especially prone to aggregation, but because this structure is 
not Infectious in this w^, it cannot spread from one animal to another. (C) Drawii^ of a cross-beta filament, 
a common type of protease-resistant protein aggregate found in a variety of human neurological diseases. 
Because the h/drogen-bond interactions in a P sheet form between polypeptide backbone atoms (see Rgure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the likely change of two a-helices into four 
p-s^fands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpell. adapted from M. Sunde et al..J. Mol BioL 273:729-739, 
1 997; D, adapted.from S.B- Prusiner, Trends Brochcm. SdL 2 1 :482-487, 1 996.) 

animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow disease") from catde to humans in Great Britain. 

Fortunately, in the absence of PrP*» PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformation^ a similar transformation has been discovered 
to be the cause of an othervtise mysterious ''protein-only inheritance** observed 
in yeast cells. 

* 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that ceils have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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Figure 6-90 The production of a 
protein by a eucaryotic ceU-The final 
level of each protein in a eucaryotic cdl 
depends upon the efftdency of each step 
depicted. 




COOH 



lire 6-90) could be regulated by the ceU for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most conunon 
point for a ceU to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step — ^the transcription of its DNA sequence into an RNA 
molecule. 

Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonudeoprotein assembly called a ribosome. The 
amino adds used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mRNA (codonsh The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule, A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs — each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anticodon. Each amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Rgure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled* Controls that operate at 
steps I through 5 are discussed ^.this 
chapter. Step 6, the regMbtion of protein 
activity, includes reversible activation or 
inactlvation by protein phosphorylation 
(discussed in Chapter 3) as v/eli as 
irreversible inactivation by proteolytic 
degradation (<fiscu$$ed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

K differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene eaqiression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling vAien and how often a 
given gene is transcribed (traiiscriptional controll, (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processmg control), (3) 
selecting which completed mRNAs in the ceil nucleus are exported to the cytosol 
and determining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. Jn the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of die chapter to the additional ways of regulating gene 
expression. 



Summary 

The genome of a cell contains in its Dm sequence the information to make many 
thousands of different protein and UNA molecules. A ceU typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
b&xiuse different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals 
from other cells. Although all of the steps involved in expressing a gene can in prin- 
<^ple he regulated, for most genes the initiation of RNA transcription is the most 
, important point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a ceU determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
^oued by a regulatory region of DNA relatively near the site where transcription 
oegms. Some regulatory regions are simple and act as switches that are tiuown 
:0y a smgle signal. Many otiiers are complex and act as tiny microprocessors, 
responding to a variety of signals tiiat they interpret and integrate to switch the 
neighbormg gene on or off. Whetiier complex or simple, tiiese switching devices 

* 



-BINDING MOTIFS IN GENE REGULATORY PROTEINS 



occur in the genu line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters oif genes that remain active 
in the germ cell lineages * (including most housekeepmg genes) are kept 
unraethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired. Such regions are preserved in modem day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The manmnalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identily- 
ing genes in the DNA sequeijices of vertebrate genomes. 

m 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different g^nes to be transcribed in different ceUs, Since many 
specialized animal cells can -maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
ceU'divides. These features endow the cell unth a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other; this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or iruiirect positive f Redback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis for cellular clodcs. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomerwn of 
genomic irriprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 



VERTEBRATE ANCESTOR DNA 



nnnilj 



RNA 



methylation of 
most CG sequences 
in germ line 




many millions of years 
of evolution 



VERTEBRATE DNA 



L 



C6 island 



Figure 7-86 A mechanism to explain 
both the nruttlced overall deficiency 
of CG sequences and th^r clustering 
into CG islands In. vertebrate 
genomes. A 6/ock Sne marks the bcation 
of a CG dinucleotide in the DNA 
sequence* while a red "lollipop'* indicates 
the presence of a methyl ^oup on the 
CG dinucleotide. CG sequences that lie In 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained in 
evolution. Methylated CG sequences, on 
the o^er hand, tend to be lost through 
deamination of 5-methyi C toT. unless the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these pbsltranscriptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control, for 
niany genes they are crucial. 



POSTTRANSCRIPTIONAL CONTROLS 
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CHAPTER 29 



Regulation of transcription 



Tlie plienoh-ptc dilTerf iices lhal ciislin^uish ihe 
.t-arious kinds of rells in a hi':her eiikarxole are 
largely due lo flifTerences in ihe expression of 
renes thai code for prtiieins. that is. those Iran- 
.crrtbed by RXA polymerase 11. In principle, the 
expression of these ^lenes niijrhi be repulaied at 
any one of several singes. The concept of the 
-level of conlror implies ihai «£ene expression 
is nol necessarily an automatic process once it 
has begun. It could be regulated in a gene- 
specific way at any one of several sequential 
jieps- We can dislinguisli (at least) five poten- 
iial control points, forming the series: 

Acttvalion of gem- slniclure 
i 

fnitialiun of Iranscriplion 
1 

Processing the transcnpt 
i 

Transport, to cvtopfasm 
i * 

Translation of inHNA 

Tlie existence <if ihe first su«j) is implied by 
me discover\ thai genes niay exist in either of 
Uvo stniclural conditions. Kel;iiive to the stale 
nf most of the genome, i^ttw^s. are found in 
m ^^cllve- stiate in Ihe cells in which they 
are expressed (see Chnpier 27). The cliange of 
Mructure is distinct from the act of transcrip- 
lion, and indicates that the gene is "Iranscrib- 
abie." This suggests that acquisition of ihe 
^active- structure must be Ihe first step in gene 
expression. 

Transcription of a gene in the active slate is 



conlrolled at the stage of initiation, that is. bv 
the interaction of R.\A polymera.se with its prt>- 
tiioler. This is now becoming susceptible to 
analysis in the in vUro systems (see Chapter 
28). For most genes, this is a mafor control 
point: probably it is the most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in eukar>. 
otic cells, for example, via anlilennination 
mechanisms. 

The pHnian- transcript is modified by capping 
at the S' end, and usually aLso by polyadenyla- 
lion at the 3' end. Introns must be spliced out 
from Ihe Iran.scripts of interrupted genes. The 
mature R.\A must be exported Ironi the nucleus 
lo Ihe cytoplasm. Regulation of gene expression 
by selection of sequences at the levef of nuclear 
RNA might involve any o.r all of these stages, 
bill the one for which we have most evidence 
concerns changes in splicing: some genes are 
expres.sed by n>eans of alternative splicing pot* 
terns who.<ie regulalii>n controls the type of pro- 
tein product (see Chapler 30). ^ 

Rnally, ihe tmnstation of an mRNA In the c>to- 
pla.sm can be s|>ecirically conlrolled. There is liUle 
evidence for the employmeiu of this mechanism in 
adult somatic cells, but it does occur in some 
einboonic situations, as descril)ed in Chapter 7. 
The mechanism is presumed lo involve the block- 
ing of initiation of translation of .some niRNAs by 
specific protein factors. 

But having acknowledged that control of gene 
expression can occur at mulUple stages, and 
that production of RNA cannot inevitably be 
equaled wiih production of protein, it is clear 
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that the ovemhelming Tnajohty or regulatory 
events occur at^the^ iiiltiation of IranscripUon. 
Regulation of tissue-specific gene transcription 
lies at the heart of eukaryotic differentiation; 
indeed* we see examples in Chapter 38 in 
which proteins thai regulate embr>'onic devel- 
opmenl prove to be transcription factors. A reg- 
ulatory transcription factor serves to proAide 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulaUon: wha.t identifies 
the common target genes to the transcription 

factor; and how is the activity of the transcrip- 
tion factor itself regulated in response to inUin- 
sic or extrinsic signals? 



Response elemenls identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they' share a pi^inoter element that is recognized 
to' a regulatory transcription factor. An element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
HSE (heal shodi response element), GRE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and tlie elemenls thai they recognize are 
summarized in. Table 29.1. Response elements 
have the same general characteristics as 
upstream elemenls. of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for a short distance on either side of 



Table 29.1 _ Inducible transcription factors bind to 
response elements that identify groups o! prcmolers^ 
Of enhancers subjsct to coordinate control. 



RcQulatory Agent Module Consensus 



Heat shock 
Glucocorticotd 
Pnortjol ester 
Serum 



HSE CNNGAANNTCCNNG 

GRE TGGTACAAATGTTCrr 

TRE TGACTCA 

SRE CCATATTAGG 



Factor 

HSTF 
Receptor 
API 
SRF 



the consensus sequence. In promoters, the ele- 
menls are not present at fixed distances from 
the slartpoint, but are usually <200 bp upstream 
of it- The presence of a single element usually 
is sufficient to confer the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in 
■meters or in enhancers. Some types of elements 
are typically found In one rather than the other 
usually an HSE is found in a promoter, while • 
GRE is found in an enhancer. We assume tM 
all response elemenU function by the sa^"^, 
general principle. A gene is regulated tv ^ 
sequence at tlie promoter or enhancer that |* 
recognized by a specific protein. The prvttt» 
Junctions as a transcription factor needed fi^ 
Rl>^A potytnerase to initiate. Active protein ^ 
available only under conditions when the^ g^ 
to be expressed; its absence means that the P^^ 
moter is not activated by this particular ctrf ' 
An example of a situation in which w****^ 
genes are controlled by a single factor is 
\ided by the heal shock response. This is ^^^^ 
mon to a wide range of prokaryoies ^ 
eukar>'Ote5 and involves multiple coniro^^ 
gene expression: an increase in temP^*"^*'^^,, 
lums on transcription of some genes, ^"^^^^^A 
transcription of the heal shock 
causes changes in the translation ol ^ .^js 
The control of the heat shock genes »1**^^ 
the differences berween prokaryotic ^^^^ 
eukaryotic modes of control in. bacteria. ^ 
Sigma factor is synthesized that ^^^^^ j\et^ 
polymerase holoenzyme to recognize 
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Abstract 

Background: Prostate stem cell antigen (PSCA) is a recently defined honriologue of the Thy-l/Ly-6 family of 
glycosylphosphatidylinositol (GPI)-anchored cell surface antigens. The purpose of the present study was to 
examine the expression status of PSCA protein and mRNA in clinical specimens of human prostate cancer (Pea) 
and to validate it as a potential molecular target for diagnosis and treatment of Pea. 

Materials and Methods: Immunohistochemical (IHC) and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 benign prosutic hyperplasia 
(BPH), 20 prostatic intraepithelial neoplasm (PIN) and 48 prosute cancer (Pea) tissues, including 9 androgen- 
Independent prostate cancers. The level of PSCA expression was semi quantitatively scored by assessing both the 
percentage and intensity of PSCA-poVitive staining ceils in the specimens. Then compared PSCA expression 
between BPH. PIN and Pea tissues and analysed the correlations of PSCA expression level with pathological grade, 
clinical stage and progression to androgen-independence in Pea. 

Results: In BPH and low grade PIN, PSCA protein and mRNA staining were weak or negative and less intense 
and uniform than that seen in HGPIN and Pea. There were moderate to strong PSCA protein and mRNA 
expression In 8 of H (72.7%) HGPIN and in 40 of 48 (83,4%) Pea specimens examined by IHC and ISH analyses, 
with statistical significance compared with BPH (20%) and low grade PIN (22.2%) samples (p < 0.05. respectively). 
The expression level of PSCA increased with high Gleason grade, advanced stage and progression to androgen- 
independence (p < 0.05, respectively). In addition. IHC and ISH suining showed a high degree of correlation 
between PSCA protein and mRNA overexpression. 

Conclusions: Our data demonstrate that PSCA as a new cell surface marker is overexpressed by a majority of 
human Pea. PSCA expression correlates positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and androgen-independence. and 
speculatively with prostate carcinogenesis. PSCA protein overexpression results from upregulated transcription 
of PSCA mRNA. PSCA may have prognostic utility and may be a promising molecular urget for diagnosis and 
treatment of Pea. 
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Introduction 

Prostate cancer (Pea) is the second leading cause of can- 
cer-related death in American men and is becoming a 
common cancer inaeasing in China. Despite recently 
great progress in the diagnosis and management of local- 
ized disease, there continues to be a need for new diagnos- 
tic markers that can accurately discriminate between 
indolent and aggressive variants of Pea. There also contin- 
ues to be a need for the identification and charaaerization 
of potential new therapeutic targets on Pea cells. Current 
diagnostic and therapeutic modalities for recurrent and 
metastatic Pea have been limited by a lack of specific tar- 
get antigens of Pea. 

Although a number of prostate-specific genes have been 
identified (i.e. prostate specific antigen, prostatic acid 
phosphatase, glandular kallikrein 2), the majority of these 
are secreted proteins not ideally suited for many immuno- 
logical strategies. So, the identification of new cell surface ' 
antigens is critical to the development of new diagnostic 
and therapeutic approaches to the management of Pea. 

Rciter RE et al ( 1 ] reported the identification of prostate 
stem cell antigen (PSCA), a ceil surface antigen that is pre- 
dominantly prostate specific The PSCA gene encodes a 
123 amino acid glycoprotein, with 30% homology to 
stem cell antigen 2 (Sea 2). Like Sca-2, PSCA also belongs 
to a member of iheThy-l/Ly-6 family and is anchored by 
a glycosylphosphatidylinositol (GPl) linkage. mRNA in 
situ hybridization (ISH) localized PSCA expression in nor- 
mal prostate to the basal cell epithelium, the putative 
stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor 
cells. 

In order to examine the status of PSCA protein and mRNA 
expression in human Pea and validate it as a potential 
diagnostic and therapeutic target for Pea, we used immu- 
nohistochemistry (IHC) and in situ hybridization (ISH) 
simultaneously, and conducted PSCA protein and mRNA 
expression analyses in paraffin-embedded tissue speci- 
mens of benign prostatic hyperplasia (BPH, n = 20), pros- 
tate intraepithelial neoplasm (PIN, n - 20) and prostate 
cancer (Pea, n = 48). Furthermore, we evaluated liie possi- 
ble correlation of PSCA expression level with Pea tumori- 
genesis, grade, stage and progression to androgen- 
independenee. 

Materials and methods 

Tissue samples 

All of the clinical tissue specimens studied herein were 
obtained from 80 patients of 57-84 years old by prosiaie- 
etomy, iransurethral resection of prostate (TURP) or biop- 
sies. The patients were classified as 20 cases of BPH, 20 
cases of PIN, 40 eases of primary Pea, including 9 patients 



with recurrent Pea and a history of androgen ablation 
therapy (orchiectomy and/or hormonal therapy), who 
were referred to as androgen-independent prostate can- 
cers. Eight specimens were harvested from these andro- 
gen-independent Pea patients prior to androgen ablation 
treatment. Each tissue sample was cut into two parts, one 
was fuced in 10% formalin for IHC and the other treated 
with 4% paraformaldehyde/0.1 M PBS PH 7.4 in 0.1% 
DEPC for 1 h for ISH analysis, and then embedded in par- 
affin. All paraffin blocks examined were then cut into 5 
\xm sections and mounted on the glass slides specific for 
IHC and ISH respectively in the usual fashion. H&E- 
stained section of each Pea was evaluated and assigned a 
Cleason score by the experienced urological pathologist at 
our institution based on- the criteria of Cleason score |2j. 
The Cleason sums are summarized in Table 1. Clinical 
staging was performed according to jewett-whitmore- 
prout staging system, as shown in Table 2. In the category 
of PIN, we graded the specimens into two groups, i.e. low 
grade PIN (grade I - H) and high grade PIN (HGPIN, 
grade HI) on the basis of literatures |3,4]. 

tmmunohistochemicat (IHC) anatysis 
Briefly, tissue sections were deparaffinized^ dehydrated, 
and subjected to microwaving in 10 mmol/L citrate 
buffer. PH 6.0 (Boshide, Wuhan, China) in a 900 W oven 
for 5 min to induce epitope retrieval., Slides were allowed 
to cool at room temperature for 30 min. A primary mouse 
antibody specific to human PSCA (Boshide, Wuhan, 
China) with a 1:100 dilution was applied to incubate with 
the slides at room temperature for 2 h. Labeling was 
detected by sequentially-adding biotinylated secondary 
antibodies and strepavidin-peroxidase, and localized 
using 3,3'-diaminoben2idine reaction. Sections were then 
counterstained with hematoxylin. Substitution of the pri- 
mary antibody with phosphate-buffered-saline (PBS) 
served as a negative-staining control. 

mRNA in situ hybridization (ISH) 

Five-pm-thiek tissue seaions were deparaffinized and 

dehydrated, then digested in pepsin solution (4 mg/ml in 
3% citric acid) for 20 min at 37.5 'C, and fiirther proc- 
essed for ISH. Digoxigenin-labeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, 
Wuhan, China) were hybridized to the sections at 48*'C 
overnight. The posthybridization wash widi a high strin- 
gency was performed sequentially at 37*" C in 2 x standard 
saline citrate (SSC) for 10 min, in 0.5 x SSC for 15 min 
and in 0.2 ^ SSC for 30 min. The slides were then incu- 
bated to biotinylated mouse anii-digoxigenin antibody at 
37.5'*C for 1 h followed by washing in 1 « PBS for 20 min 
at room temperature, and then lo sirepavidin-peroxidase 
ai 37.5**C for 20 min followed by washing in 1 x PBS for 
15 min at room temperature. Subsequently, the slides 
were developed with diaminobenzidine and then coun- 
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Table I: Correlation of PSCA expression with Gleason score 







Imenslty x 


frequency 




Gleason score 


0-6 {%) 






9(%) 


2-4 
5-7 
8-10 


5(83) 

19(79) 

5(28) 






MIT) 
5(21) 

13(72) 




Table 1: Correlation of PSCA expression with clinical stage 
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terstained with hematoxylin to localize the hybridization 
signals. Sections hybridized with the sense control probes 
routinely did not show any specific hybridization signal 
above background. All slides were hybridized with PBS to 
substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCIA 
immunostaining and mRNA in situ hybridization, the 
same scoring manners are taken in the present study for 
PSCA protein staining by IHC and VSCA mRNA staining 
by ISH- Each slide was read and scored by two independ- 
ently experienced urological pathologists using Olympus 
BX-41 light microscopes. The evaluation was done in a 
blinded fashion. For each section, five areas of similar 
grade were analyzed semiquaniitatively for the fraction of 
cells staining. Fifty percent of specimens were randomly 
chosen and rescored to determine the degree of interob- 
server and intraobserver concordance. There was greater 
than 95% intra- and inlerobserver agreement. 

The intensity of PSCA expression evaluated microscopi- 
cally was graded on a scale of 0 to 3+ with 3 being the 
highest expression observed {0, no staining; 1+, mildly 
intense; 2+, moderately intense; 3+, severely intense). The 
staining density was quantified as the percentage of cells 
staining positive for PSCA with the primary antibody or 
hybridization probe, as follows: 0 = no staining; 1 = posi- 
tive staining in <25% of the sample; 2 - positive staining 
in 25%-50% of the sample; 3 = positive staining in >50% 



of the sample. Intensity score (0 to 3+) was multiplied by 
the density score (0-3) to give an overall score of 0-9 
1 1,51. this way, we were able to differentiate specimens 
that may have had focal areas of increased staining from 
those that had diffuse areas of increased staining [6|. The 
overall score for each specimen was then categorically 
assigned to one of the following groups: 0 score, negative 
expression; 1-2 scores, weak expression; 3-6 scores, mod- 
erate expression; 9 score, strong expression. 

Statistical anolYsis 

Intensity and density of PSCA protein and mRNA expres- 
sion in BPH, PIN and Pea tissues were compared using the 
Chi-square and Student's i-test. Univariate associations 
between PSCA expression and Gleason score, clinical 
stage and progression to androgen-independence were 
calculated using Fisher's Exact Test. For all analyses, p < 
0.05 was considered statistically significant. 

Results 

PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed 
weakly in individual samples of BPH. Some areas of 
prostate expressed weak levels (composite score 1-2), 
whereas other areas were completely negative (composite 
score 0). Four cases (20%) of BPH had moderate expres- 
sion of PSCA protein and mRNA (composite score 4-6) 
by IHC and ISH. In 2/20 (10%) BPH specimens, PSCA 
mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 
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2) in one and negative (composite score 0) in the other. 
PSCA expression was localized to the basal and secretory 
epithelial cells, and prostatic stroma was almost negative 
staining for PSCA protein and mRNA in all cases 
examined. 

PSCA expression In PIN 

In this study, we detected weak or negative expression of 
PSCA protein and mRNA (^2 scores) in 7 of 9 (77.8%) 
low grade PIM and in 2 of 1 1 (18.2%) HGPIN, and mod- 
erate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 1 1 (45.5%) HGPIN. One HGPIN with moderate 
PSCA mRNA expression (6 score) was found weak stain- 
ing for PSCA protein (2 score) by IHC. Strong PSCA pro- 
tein and mRNA expression (9 score) were deteaed in the 
remaining 3 of 11 (27.3%) HGPIN. There was a statisti- 
cally significant difference of PSCA protein and mRNA 
expression levels observed between HGPIN and BPH (p < 
0.05), but no statistical difference reached between low 
grade PIN and BPH (p > 0.05). 

PSCA expression in Pea 

In order to determine if PSCA protein and mRNA can be 
detected in prostate cancers and if PSCA expression levels 
are increased in malignant compared with benign glands. 
Forty-eight paraffm-embedded Pea specimens were ana- 
lysed by IHC and ISH. It was shown that 19 of 48 (39.6%) 
Pea samples stained very strongly for PSCA protein and 
mRNA with a score of 9 and another 21 (43.8%) speci- 
mens displayed moderate staining with scores of 4-6 (Fig- 
ure 1). In addition, 4 specimens with moderate to strong 
PSCA mRNA expression (scores of 4-9) had weak protein 
staining (a score of 2) by IHC analyses. Overall Pea 
expressed a significantly higher level of PSCA protein and 
mRNA than any other specimen category in this study (p 
< 0.05, compared with BPH and PIN respectively). The 
result demonstrates that PSCA protein and mRNA are 
overexpressed by a majority of human Pea. 

Correlation of PSCA expression with Gieason score in Pea 
Using the semi-quantitative scoring method as described 
in Materials and Methods, we compared the expression 
level of PSCA protein and mRNA with Gleason grade of 
Pea, as shown in Table 1. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentia- 
lion, 5-7 scores = moderate-differentiation and 8-10 
scores = poor-differentiation |7|. Seventy-two percent of 
Gleason scores 8-10 prostate cancers had very strong 
staining of PSCA compared to 21% with Gleason scores 
5-7 and 17% with 2-4 respectively, demonstrating that 
poorly differentiated Pea had significantly stronger 
expression of PSCA protein and mRNA than moderately 
and well differentiated tumors (p < 0.05). As depicted in 
Figure 1, IHC and ISH analyses showed that PSCA protein 
and mRNA expression in several cases of poorly differen- 



tiated Pea were particularly prominent, with more intense 

and uniform staining. The results indicate that PSCA 
expression increases significantly with higher tumor grade 
in human Pea. 

Correlation of PSCA expression with clinicat stage in Pea 
With regards to PSCA expression in every stage of Pea, we 
showed the results in Tabic 2. Seventy-five percent of 
locally advanced and node positive cancers (i.e. C-D 
stages) expressed statistically high levels of PSCA versus 
32.5% that were organ confined (i.e. A-B stages) (p < 
0.05). The data demonstrate that PSCA expression 
increases significantly with advanced tumor stage in 
human Pea. 

Correlation of PSCA expression with androgen- 
independent progression of Pea 

All 9 Specimens of androgen-independent prostate can- 
cers stained positive for PSCA protein and mRNA. Eight 
specimens were obtained from patients managed prior to 
androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the 
strongest staining category (score = 9), compared with 
three out of eight (37.5%) of patients with androgen-, 
dependent cancers (p < 0.05). The results demonstrate 
that PSCA expression increases significantly with progres- 
sion to androgen-independence of human Pea. 

It is evident from the results above that within a majority 
of human prostate cancers the level of PSCA protein and 
mRNA expression correlates significantly with increasing 
grade, worsening stage and progression to androgen-inde- 
pendence. 

Correiot/on of PSCA intmunostaining and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the 
results of PSCA IHC staining with mRNA ISH analysis. 
Positive staining areas and its intensity and density scores 
evaluated by IHC were identical to those seen by ISH in 79 
of 88 (89.8%) specimens (18/20 BPH, 19/20 PIN and 42/ 
48 Pea respectively). Importantly, 27/27 samples with 
PSCA mRNA composite scores of 0-2, 32/36 samples 
with scores of 3-6 and 22/24 samples with a score of 9 
also had PSCA protein expression scores of 0-2, 3-6 and 
9 respectively. However, in 5 samples with PSCA mRNA 
overall scores of 3-6 and in 2 with scores of 9 there were 
less or negative PSCA protein expression (i.e. scores of 0- 
4), suggesting that this may reflect postiranscriptional 
modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data 
demonstrate that the results of PSCA immunosiaining 
were consistent with those of mRNA ISH analysis, show- 
ing a high degree of correlation between PSCA protein 
and mRNA expression. 
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Figure t 

Representatives of PSCA IHC and ISH staining in Pea (A. IHC staining, B. ISH staining, x200 magnification). A,, B,: negative con- 
trol of IHC and ISH. PBS replacing the primary antibody (A,) and hybridization with a sense PSCA probe (B,) showed no back- 
ground staining. A^. B^: a moderately differentiated Pea (Gleason score = 3+3 = 6) with moderate staining (composite score = 
6) in all malignant cells; Aj: IHC shows not only cell surface but also apparent cytoplasmic staining of PSCA protein. Aj. B3: a 
poorly differentiated Pea (Gleason score = 4+4 = 8) with very strong staining (composite score = 9) In all malignant cells. 
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Discussion 

PSCA is homologous to a group of cell surface proteins 
that mark the earliest phase of hematopoietic develop- 
naent. PSCA mRNA expression is prostate-specific in nor- 
mal mate tissues and is highly up-regulated in both 
androgen-dependeni and-independent Pea xenografts 
{IAPC-4 tumors). We hypothesize that PSCA may play a 
role in Pea tumorigenesis and progression, and may serve 
as a target for Pea diagnosis and treatment. In this study, 
IHC and ISH showed that in general there were weak or 
absent PSCA protein and mRNA expression in BPH and 
low grade PIN tissues. However, PSCA protein and mRNA 
arc widely expressed in HGPIN, the putative precursor of 
invasive Pea, suggesting diat up-regulation of PSCA is an 
early event in prostate carcinogenesis. Recently, Reiter RE 
et al [ 1 ], using ISH analysis, reported that 97 of 1 1 8 (82%) 
HGPIN specimens stained suongly positive for PSCA 
mRNA. A very similar finding was seen on mouse PSCA 
(mPSCA) expression in mouse HGPIN tissues by Tran C. 
P ci a! |8]. These data suggest that PSCA may be a new 
marker associated with transformation of prostate cells 
and tumorigenesis. 

We observed that PSCA protein and mRNA are highly 
expressed in a large percentage of human prostate cancers, 
including advanced, poorly differentiated, androgen- 
independent and metastatic cases. Fluorescence-activated 
celt sorting and confocal/ immunofluorescent studies 
demonstrated cell surface expression of PSCA protein in 
Pea cells [9). Our IHC expression analysis of PSCA shows 
not only cell surface but also apparent cytoplasmic stain- 
ing of PSCA protein in Pea specimens (Figure 1 ). One pos- 
sible explanation for this is that anti-PSCA antibody can 
recognize PSCA peptide precursors that reside in the cyto- 
plasm. Also, it is possible that the positive staining that 
appears in the cytoplasm is actually from the overlying 
cell membrane (5). These data seem to indicate that PSCA 
is a novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression 
correlates with high grade (i.e. poor differentiation), 
increased tumor stage and progression to androgen-inde- 
pendenee of Pea. These fmdings support the original IHC 
analyses by Gu Zet al (9], who reported that PSCA protein 
expressed in 94% of primary Pea and the intensity of 
PSCA protein expression increased with tumor grade, 
stage and progression to androgen- independence. Our 
results also collaborate the recent work of Han KR ei al 
1 10], in whicli the significant association between high 
PSCA expression and adverse prognostic features such as 
high Cleason score, seminal vesicle invasion and capsular 
involvement in Pea was found. Ii is suggested Uiai PSCA 
overexpression may be an adverse predictor for recur- 
rence, clinical progression or survival of Pea. Hara H el al 
111] used RT-PCR detection of PSA, PSMA and PSCA in 1 



ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed that among 58 PCa 
patients, each PGR indicated the prognostic value in the 
hierarchy of PSCA>PSA>PSMA RT-PCR. and extraprostatic 
cases with positive PSCA PGR indicated lower disease-pro- 
gression-free survival than those with negative PSCA PGR, 
demonsuating that PSCA can be used as a prognostic fac- 
tor. Dubey P ct al 1 1 2) reported that elevated numbers of 
PSCA + cells correlate positively with the onset and devel- 
opment of prostate carcinoma over a long time span in 
the prostates of the TRAMP and PTEN +/- models com- 
pared with its normal prostates. Taken together with our 
present fmdings, in which PSCA is overexpressed from 
HGPIN to almost frank carcinoma, it is reasonable and 
. possible to use increased PSCA expression level or 
increased numbers of PSCA-positive cells in the prostate 
samples as a prognostic marker to predict the potential 
onset of this cancer. These data raise the possibility that 
PSCA may have diagnostic utility or clinical prognostic 
value in human Pea. 

The cause of PSCA overexpression in Pea is not known. 
One possible mechanism is that it may result from PSCA 
gene amplification. In humans, PSCA is located on chro- 
mosome 8q24.2 which is often amplified in meta- 
static and recurrent Pea and considered to indicate a poor 
prognosis |13-15j. Interestingly, PSCA is in close proxim- 
ity to the c-myc oncogene, which is amplified in >20% of 
recurrent and metastatic prostate cancers |16,17]. Reiter 
RE ei al 1 18| reported that PSCA and MYC gene copy num- 
bers were co-amplified in 25% of tumors (five out of 
twenty), demonstrating that PSCA overexpression is asso- 
ciated with PSCA and MYC coamplification in Pea. Gu Z 
ei al 19) recently reporleted that in 102 specimens availa- 
ble to compare the results of PSCA immunosiaining with 
their previous mRNA ISH analysis, 92 (90.2%) had iden- 
tically positive areas of PSCA protein and mRNA expres- 
sion. Taken together with our findings, in which we 
detected moderate to strong expression of PSCA protein 
and mRNA in 34 of 40 (85%) Pea specimens examined 
simultaneously by IHC and ISH analyses, it is demon- 
strated that PSCA protein and mRNA overexpressed in 
human Pea, and that the increased protein level of PSCA 
was resulted from the upregulated transcription of its 
mRNA. 

At present, the regulation mechanisms of human PSCA 
expression and its biological function are yet to be eluci- 
dated. PSCA expression may be regulated by multiple fac- 
tors [ 1 8). Waiabe T et al 1 1 9) reported that transcriptional 
control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be 
regulated or mediated through cell-cell contact and pro- 
tein kinase C (PKC) |20]. Homologues of PSCA have 
diverse activities, and have themselves been involved in 
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carcinogenesis. Signailing through SCA-2 has been dem- 
onstrated to prevent apoptosis in immature thymocytes 
[21]. Thy-1 is involved in T cell activation and iransducts 
signals through src-like tyrosine kinases |22]. Ly-6 genes 
have been implicated both in tumorigenesis and in cell- 
cell adhesion [23-251. Cell-cell or cell-matrix interaction is 
critical for local tumor growth and spread to distal sites. 
From its restricted expression in basal cells of normal 
prostate and its homology to 5CA-2, PSCA may play a role 
in stem/progenitor cell function, such as self-renewal (i,e, 
anti-apoptosis) and/or proliferation jl]. Taken together 
with the results in the present study, we speculate that 
PSCA may play a role in tumorigenesis and clinical pro- 
gression of Pea through affecting cell transformation and 
proliferation. From our results, it is also suggested that 
PSCA as a new cell surface antigen may have a number of 
potential uses in the diagnosis, therapy and clinical prog- 
nosis of human Pea. PSCA overexpression in prostate 
biopsies could be used to identify patients at high risk to 
develop recurrent or metastatic disease, and to discrimi- 
nate cancers from normal glands in prostatectomy sam- 
ples. Similarly, the detection of PSCA-overexpressing cells 
in bone marrow or peripheral blood may identify and pre- 
dict metastatic progression better than currerit assays, 
which identify only PSA-positive or PSMA-positive pros- 
tate cells. 

In summary, we have shown in this study that PSCA pro- 
tein and mRNA are maintained in expression from 
HGPIN through all stages of Pea in a majority of cases, 
which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell dif- 
ferentiation), advanced stage and androgen-independent 
progression. PSCA protein overexpression is due to the 
upregulaiion of its mRNA transcription. The results sug- 
gest that PSCA may be a promising molecular marker for 
the clinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of this tumor. 
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Abstfod 

IhutsLiUon biMation Is regulated In response to 
mitrtent sntaHsbmt^ and nytogmUo stimulation and Is 
coupled vWth ceO cycle progression and cefl grawttu 

Several aRmtions In tianslstlQnal control occur In 
cancer. Variant mRHA sequences can after the 

translatlomri effidency of Incfivldual mRN4 moleculee» 
wWch in turn play a role In cancer biology. Changes in 

tlie expression CNT avaOabOity of ccttnponents of tfie 
transfational madtlnery and In the acUvatlon off 
translation through slgn^ transduction pathw^ can 
lead to more global changes, such as an Increase bi 
the overA rale of prolefai synthesis and tmislation^ 
activation of the mRNA molecules Involved In cell 
growtf) and proWmtkm. We review tfie basic 
prindpies of Iransiational control Ihe afterations 
encountered In panoer, and selected therapies 
targeting translation Initiation to help elucidate new 
therapeutic avenues^ 

introduction 

The ftindamental principle of molecular therapeutics in can- 
cer is to exploit tiie differences In gene expression between 
cancer cells and nomiaJ cells. With the advent of ci:^ aiTBy 
technology, most efforts have concentrated on Identt^ng 
differences In gene expression at the level of mfWA, which 
can be attributable elttier to DMA ampllflcatlon or to differ- 
ences In transcriptton. Gene expresston Is quite complicated^ 
however, and Is also regulated at the level of mRNA stability, 
mRIMA translation, and protein stal>liity. 

Ihe power of translatfonal regulation has been tjest recog- 
nized among developmentaJ biologists, because transcdption 
does not occur In early embryogenesls In eukaiyotes. For ex- 
ample, in Xenopus, tiie period of transcriptional qutesoenoe 
continues iviffl the embryo reaches ml<tt>lastula transition, the 
4000-cell staga Therefore, atl necessary mRNA molecules are 
transcribed during oogenesis and stockpiled In atransIaHonally 
inactive, masked form. The mRNA are translatfonafly activated 
at appropriate times during oocyte maturation, fertilization, and 



caHy ennixyogene* and ttius, are under strict tianslatkMial 
contret. 

Translation has m established role In c^l ^owtit Ba^~ 
cally, an Increase hi jHoteIn synttiesis occira as a consa- 
quence of n^togenesls. Until recentiy, howev^, IKtie was 

known about ti)e aherations in mRNA translation in cancer, 
and mudi la yet to be jtfiscovered about their role bi the 
dervek>pment and progr^on of cancer* Here we review the 
basic principles of translational contrpt the alterations en- 
oounteied in cancer, and selected theraptos targeting tr^fftsla*- 
tion InltlaHdn to elUddata potentferi new tiierapeutk^ aven^ 

Ba^ Princlplea of Translational Control 

Mechanism of Translation initiation 

TrvisleHon InlUatkH) Is tiie main step in transl^^ 

Translatkw Inlttatton b a cornpJex prcx»^ In whWi tile InWa^ 

tflNA and tiie 40S and 60S rfiwsomal subunlts are recruited to 
tiie 5' end of a ml=^ rnolecute £BfKl assemt)ted by eukaryi^ 
transtatfon inhlatton factors Into an SOS ribosoma at the ^art 

codon of the rnTOOA (Ftg. 1). The 5' end of eukaryotic nil^NA Is 
capped, /.a. contains the cap structure m^GpppN (7-nietiiyl- 

guanosine-triphospho-S'-fibbnucteoside}. MoA translation In 
eukaryotes occurs In a cap<lependem fashion, y.a, tits cap Is 
specifically recognised by the elF4]^^ whk^i b&Kis tile 5' cap. 
The elF4F translation Inilialion complex Is tiien formed by tiie 
assemljiy of elF4E, tiie RNA helicase'elF4A, and dF4G, a 
scafibldlng protein that mediates tiie binding of tiie 40S rft^ 
sonrial subunit to tiie mRNA nK>lecule tiirou^ Interaction wftii 
ttie eiF3 prot^ pres«it on tiie 40S nTx>soma elF4A and elF4B 

parikipate in melting the secondary stnjchire of tiie 5' UTR of 
tiie mRNA. The 438 IrytlalkHi compiax (40S^BlF2/M^-ti=ttW 
GTP compte)0 scans tiie mRNA in a 5'-*3' dlfBclk)n until it 

encounters an AUG st^ codon. This start codon Is tiien base- 
paired to tiie anBcodon of inlti^or tflNA, fbnnlng tiie 48S Inlti- 

allon complex. The Inltlatton factors arc ttien displaced from tiie 
488 complex, and tiie 60S ribosome Joins to forni tiie 80$ 
ritx>some. 

Unlike most eukaryolic translation, transfatkm Initiation of 
certain mRNAs, such as tfie plcomavinis RNA, Is cap inde- 
pendent and occurs by fritemal ribosome entry. TWs mecha- 
nism does not require eiF4E. Btiier ttie 43S complex can bind 
»» InltiaOon codon dfcecHy tiwough Interaction tiie IRES In 
ttie 5' UTR such as in ttie encephalomyocarditis vfrus, or it can 
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untranstated regkx*; IRESw Imemai ifl^^ eukaryotfc 
Wtlatoiifactor4E-t)^^ 

mammaBan target of rapamytin; ATM, atsDda tetangiteqtwla mutate* 

phcwi^iatWyfinositol Wdnase: PTHN. phosph 

teted ftw chromosome 10; PP2A. proi^ phcwphatase 2A: TG»^^3, tw^ 
forming growth factor-^3; PAP. poty(A) polymerase; EPA. eioosopentaenob 
add; nxfe-7, melanoma efifferenteUon-assodated gene 7. 
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S: 'lJ'';^'atlon initiation In eukaryotes. The are hi«<*«hr«. 

ot!L SS^^S^ ^"^^^^ ^ interaction of powSbSKmS 

eonsl3MngofelF2 ^PmaaJ^S^J^^Z^' "^J** complex 
released, and the largo rOxwomal subunft to fecnA^ 

seaming or transfer, as Is the case vrfth the poliovims (1). 

ftegulatton of Translation IniOaUon 
TianslaBon Ntlatfon tean be regulated by alteraUo»» In the 
«|pr^on or phosphorylation status oi the various fectors 
»^h^Key compoiwnts In translational regulation that 
may prwIdB potential therapeuHc targets follow 

It b t!!ri«t?l''T ^ ♦«««teUon regulation. 

« fe the east abundant of the Initiation factors and Is con- 

JS!^ «f rate^lmltlng componerrt for Inlttetton of cap- 
ST^- ^"'^^ ateo be involved In mRfK 
S^n" P««8S8lng. and mRNA nucteocytopte- 

toai«o1^onaltevel m response to senm or growth factors 
^mFSJi^3?r"°" "^^ ^ preferential translation 
!• ^S^J^^'""^ secondary structure m their 

5 inn that are normally discriminated against by the tians- 



Wtonal ma*lneiy and thus are heBictently tiansla^ 
'««oniple8of iWs, oversxpresslcn of elF4E piomotss InZ 
o^sed IransMoti of vascular endothelial ^owlti fector 

«wlher mechanbin of control Is the regulatton of el^^ 

tea^jed^ tte irttogen-actlv^ pathway activating 
«*wsellular trtgnahelated kinases and the stressHicUvated 

*!^e*^"^*' ^ mltogen^ctlvgted pratebi ki- 
^ 00-13>^everal mitogens, such as senimVplatetet- 
*J«d growlh factor, epWemal growth factor. Insulin, 
angiotensin II, sic kinase overexpresslon. and teW- 
^ Phoaphoiylatldn (14). The phos- 

Ph^tetton ,«us of elF4E Is usually con^ wift the 
tnm^onal rate and growth status of the ceil; however; 
elF4E phosphorylation has also been observed h lespons^ 
to some ceHuter stresses when translattonal rates 

i^^^ 5 ^ ^ ^ *° understand 

the eHtels of elF4E phosphorylation on elF4E activity. 

T^5 •J^^'sm of regulatton Is the atteratfon of elF4E 

Sil^^^^^ of elF4Etothe elF4E*Indlng p,T>- 

SJ^^t^'.!^^"^^ ^^BP8 compete with 

elMQ for a blndbig site In eIRE The binding of elF4E to the 

^w^^L'?°^"»^ Hypophosphorylate^ 
•BP1 binds to eIRE. whereas 4&BP1 hyperphos^oiyiatton 
decreases this binding. . insulin. anglotenshTSSS 
growth factor, platelet-derived growth facfor, h^atocyte 

'^OmMhtattor. Insulln-llke growth fectors 
I OKI n. mterteuMn 3. ^ranutocyte-macrophage colony-«tbiv- 

ute*^ factor + steel factor, gastrin, and the adenovhis have 
all been reported to induce phosphorylation of 4MP1 and 
to decTBtee the ability of 4E-BP1 to bind elF4E (15 lei 
Oow,^ deprivation of nutrients or growth factors result 
in 4E-BP1 dephosphoryteMon. an bicrease m elF4E blndbia 
and a decrease In capKlependenttianslaHoa 

P»w«Phorylatlon of rib08omal40S protein 
*° ^ ^ Important rote In tiansialionai 
regutetton. S6K -I- mouse embiyonto cells prelifeiale mor« 
slowrty than do parental cells, demonstrating that S6K has a 
poslthre Influence on cet prollferSition (1 7). S6K regulates the 

°^ """^ possessing a 5' temUnal 
o^ynmkJlnetract(5'TOP)tt)undatthe5'UmofribosomaI 

PhosphorytaHon of S6K Is regu- 
tetedln^ based on the avBlIabinty of nutrients (18, 19)andls 

growth factor and InsuDiMike gncjwth faotor I {20) 
olFafl PhosphorytaHon. The binding of the Initiator tRNA 

»>y translatton InWa- 
tlon factor elF2. Phosphorytatton of the o-subunit of elF2 

^SZf JST^^"* elF2^GTP/Met-tRNA complex and 
mhibite global protein syrrthesis (21. 22). elF2a is phospho- 

'^f! °' condlttons. such as viral Infeobon 

^ deprivation, and apoptosis (22): 
?J!^?^^^ by heme^egulated Inhibitor, nutrtan- 

^ ^ IFN-lnduced. double, 
stranded FlNA-activated protein kinase (PKR; Ref. 23). 




The mTOR Signaling Pattmr^. The macrolldo ai 

p;^ has been tto auk^ect of Intend ettidy because It in- 
Ijttte signal tiansAidfon pam^ 
«on. 71© rapamydn^nsftlve component of these pathways 
k in7X)R (too called iWP or RAm). mTOR to tt^ 
mallan homotogue of theyeast TOR protehs thai reguJate Q, 
ProgiBSslon and translation In response to nutrient avallabU- 
Ity (24^ mTOR Is a serine-thieonino idnase that mbdidates 
toislatlonlnmalton by ahering the phosphor 
4&BP1 and S6K (Fig. 2; Ref. 25). 

4MP1 Is phosphoiyfated on multipleresldue& mTOR p^ 
phwylates the Thr^ and Thr-46 iBsWues of 4EW 

however, pho^horylaBon at these sites b not assocfe^ 
wWi a loss of elF4E l^lng. PhosphayteUon of 7lr-«7 and 
Ifjir^ b required for 

' pno^ihorylaftlon events appears to be needed to frihtolt the 
bindlngof 4E-«P1 toelF4E(25).-nioproductofthe>l7Wgene.. 
p3a/MSK1 pathway, and protein Wnase Oct also pl^ a rate In 
4SBP1 phosphorylation ^7-29). 

S6K and 4E-BP1 are also regulated, In part, by PI3K and Its 
downstream pretefn kinase Akt PTEN Is a phosphatase that 
negatfveV regulates PI3K signaling. FlBi nuO cells hove 
constltutlvely active of Akt. with Increased S6K activity and 
SB phosphorylatton (30). S6K activfty Is Inhibited both by 
PI3K IriWbltors wortmannin and LY294002 and by mTOR 
Inhibto rapamycln 24). Akl phosphorylates Ser-2448 In 
mTOR in wtt|o, and this site Is pbosphorylated upon Akt 
acHvatfon in vivo (31-33). Thus. mTOR Is regulated by the 
PI3K/Akt pathway: however, this does riot appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylatbn 
.Independent ot mTOR Is controverela). 

Interestingly, mTOR autophosphoryfatton Is bk>cked by wort- 
mannin but not by rapamycln (34). This seonfeig Inconsistency 
sugg^ thai mTOR-responslve regulatfon of 4E-BP1 and S6K 
activity occurs through a mechanism other than fritrinsto mTOR 
kinase acth%. An aJtemale pathway for 4&BP1 and S6K phos- 
phorylaBon by mTOR acth% is by the lnhlbltk)n of a phospha- 
tasa Treatment with calyculin A, an Inhibitor of phosphatases 1 
and 2A, reduces rapamydn-induced dephosphorylatfon of 4E- 
BP1 and S6K by rapamycln (35). PP2A Interacts with fuIHenglh 
S6K but not with a S6K mutant that Is resistant to dephospho- 
rylatfon nesultlng fnDm rapamydn. mTOR phosphorylales PP2A 
In vitro; however, how this process alters PP2A actlwty Is not 

known. These results are conslstert wrmi the model that phos- 
phorylation of a phosphatase by mTOR prevents dephospho- 
rylatfon of 4E-BP1 and S6K. and conversely, that nutHent dep- 
rtvatton and rapamycln bkx* inWbftton of the phosphatase by 
mTOR 

PoVadenylatlon. The poWi tail In eukaryotic mRNA Is 
Important In enhancing translation Inlttatfon and mRNA sta- 
bility. Polyadenylatlon plays a key role In regulating gene 
expression during oogenesis and earty embryogenesls. 
Some mRNA that are translatfonally Inactive In the oocyte are 
polyadenylated concomttanUy with transtatlonal activation In 
oocyte matuTHtloa whereas other mRNAs that are transla- 
tlonally active during oogenesis are deadenytaled and Uans- 
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lationally silenced (36-^). Thus, control of poIy(A) tafl syn« 
th^ Is an onportant regi^iy step In gene expression. 

T)»e 5' cap and poly{^ tall are thought to function synerote- 
tlcally to regulate mRNA translational effidency (39 AOL 

RNA Packaging. Most RNA-blndIng proteins m assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translatlonal fate of the transcript (41). A Wohly 
conserved family of Y-box proteins is found m cytopSamIc 
messenger rlbonudeoproteln particles, where the proteins 
aro thought to play aroTe In restricting the recniltment of 
mRNA to the tnanslallonal machinery (4V43). The ma|or 
mRNA-assoclated protein. , destabilizes the Interaction 
of e1F4Eandthe5' mRNA cap /nv«ro, andoverexpresslonof 
YB-1 results In translatlona! repression In vivo (44). Thus, 
alterations In RNA packaging can also play an Important role 
In translatlona! regulation. 

Translation Alterations Encountered in Cancer 

Three main alterations at the translational level occur In cancer 
variations In mRNA sequences that Increase or decrease trans^ 
fatlonal efficiency, changes In the expression or avallabDIfy of 
components of the tianslallonal machinery, and acHvaHon of 
translation through abenantly activated slonal transducHon 
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Miay a roio in carcinogenesis. The sec^ 

ond and third alterations can lead to more global changes, such 

as an Increase In the overaB lalB of protein synthesis, and the 
translational activation of several mRNA species. 

VaiiBthns In mRHA Sequence 
Variations In mRNA sequence affect the translational effl- 
clency of the transcript A brief description of these variations 
and examples of each mechanism follow. 

Mutations. Mutations in the mRNA sequence, especlalty 
In the 5' UTR. can alter Its translational efficiency, as seen In 
the following examples. 
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<HiiKC b repressed, whereas In sevoal BuiWtt tymphomas 
thai have delettoiis of the mRNA 5' Ura tra^ 

5' im of c-fi^fc contains an IFES, aixl mus 

tfon can 1)0 rnHiated by a cap^ndependenl as weO as a 

capKfepandem mechanism (46, 47). In patients wHh 
myeloma, a C-*T mutatldn In the o-myc IRES was WertWed 

W aiKlfound to cause an enhanced Wtlallw of t^ 
via Internal ribospmal entry (48). 

BACAf. Aeomatic pofrit mutation (117 Q^q in position 
■-3 wRh respect to the start codon of the B/?G^7 gene was 

Identified In a hIghV aggresslvo sporadic breast cancer C5Q). 
Chlmertc constiuds oonsisttng of the virtld-type or mutated 
BRGA1 5' urn and a downstream ludferase reporter dem- 

w^ed a decrease In the tianslational effi^^ 
uTR mutafion* • 

P)«=«M*^pemfefifia^ Some Inherited 

melanoma kindreds have a Q->T transverslon at liase -34 
of cycSn<lependent Wnase lnWbltor-2A. which encodes a 
cycllnKiependent kinase 4/cycBn<lependenl kinase 6 kfriase 
Inhlfcrftor Important In checkpoint regulaUon {51). This 
mutatlon,slves rise to a novel AUG translation lnftlatk>n 
codon. creating an upstream open reading frame that com- 
petes for scanning ribpsomes and decreases translalk>n 
nom the wnd-type AUG. - 

AHemate SpHdng and Attemale Transcription Start 
Sttes. AReratfons In spBdng and aRemate transcriptk>n sites 

. can lead to variations In 6' um sequence, length, and sd^ 
ary stnidur^ ultimately impacting translafional efficiency, 

i4m ne ATM gene has four noncocfing exons fri its 5' 
UTR that undergo extensive alternative spiking pg. TTie 
contents of 12 different 5' UTOs that show considerable 

diversity In length and sequence have been Wentilied. These 
divergent 6' leader sequences play an Important role in the 

translational regulatfon of the ATM genoL 

mcfta In a subset of tumors, overexpresston of the onco- 
pi^ mdm2 results In enhanced translaHon of the mdm2 
mRNA. Use of different promoters leads to tm md/n2 tian- 

^ts that drffer only in their 5' leaders (53). The tonger 5' 
UTO contato two upstream open reading frames, and this 
mRNA is loaded with ribosomes Inefficlentty compared vwth 
the short 5' UTR. 

BnCAI. In a normal mammary gland. BRCAl mRNA Is 
expressed with a shorter leader sequence whereas 
lnsporadlcbreastcancertlssue.BTCy\f mRNA Is expressed 
wim a tonger leader sequence (5' UTRb); the translatlonal 
efficiency of transcripts containing 5' UTRb is 1 0 times lower 
than that of hanscripts containing 5' UTT^a (54). 

TGF^fiS. TGF-pa mRNA includes a 1.1-kb 5' UTO. whteh - 
exerts an Inhibitory effect on translatioa Many human breast 
<»i^er cea nnes contain a novel Tt3F-p3 transcript with a 5 ' 
urn that Is 870 nucteotkJes shorter and has a 7-fbId greater 
translatf one! efficiency than the nonnal VOF-fiS mRNA 

Altemate Polyaderiylatlon Sites. Multlpte polyadenyl^ 
atlon signals leading to the generatton of several transcripts 
with differing 3' l/fR have been described for several mRNA 
specif, such as the RBT pioto-oncogene (56), ATM gene 
P2h tissue inhibitor of metalloprotelnases-3 (57). RHOA 



protoK)ncogene arKi calmodulM ps). Although the 
effect of these altemate y UTRs on translation Is not y&t 
ttwy may be Important m RNA-proteIn Interacttons 

that affect translatlbnal recfititnient The iole of these alter 
attons In cancer dewetopment and prooressloii is unkn»uiin 



MemOons In iho dmqiment^ of 
Tmnstothn Machbmy 

Alterattona In the compon^ of translatton machinery can 
take many forms. 

Oweiejcpressslon erf elF4E. Overexpresslon of elF4E 

wuses maBgnam transformation In rodent cells (BQ and ttio 
dwegulatkm of HeU cett gnwth (61). Pokmovsl^ etaL (62!> 

fbimd that elF4E overexpresslon substltules Ibr serum ana 
Individual growth factors in present vlabDity of fibioblasts. 

vifhfchsuggeslsthalelREcan mediate both proliferative anci 
survival signaling. 

Elevated levels of elF4E mRNA have been found In a broacj 
spectniro of transftmned ceO Ibies (63). elF4E levels 
^evated In all diittal cardnoma In situ specimens and inva- 
sive ductal carcinomas, compared wUh benign breast spec^ 
imens evaluated with Western blot analysis (64, 65). Prelim^ 

mary studies suggest that this overexpresslon is attributablo 
to gene ampDficafion (66). 

There are accuraiating data su£»esllng that eiF4E oveiw- 
pressioncan be vaiuaweas a prognostic maricer. elF4E over- 
expression vi«s found bi a retrD^)ecW^ 

poor fHognosIs In stages I to IB breast carcinoma (P7). Verifier 

*>n cf the prognostic value of elF4E fci breast cancer b now 
under w^ m a prospective trial (67). Hov^rever, In a different 
study, eIRE expression was conelated wUh the aggressive 
behavior of non4todgkln's lymphomas (68), in a prospective 
analysfe of patfehts with head and neck cancer, Novated lev^ 
of eF4E In histologically tumor-^ s«^cal margins predicted 
a signlficanlly fafKreased risk of localneglonal recunence m 
These results aQ suggest that eIF4E oveiaxpresslon can be 

used to select patients vwho migW benemfrt)m more aggressive 
«ystenilc thenapy. Fwthemwe. the head an^ 

suggest that elF4E overexpresslon Is a field defect and can be 
used to guJde k)cal therapy. 

AHerations in Other Initiation Factors. Alterations In a 
number of other initiation factors have been associated with 
cancer. Overproduction of eIF4G, similar to elF4E, leads to 
malignant transfonnatlon (n vhro (69). eiF.2a Is found In 
Increased levels in bronchioToalveolar carcinomas of the lung 
^). Initiation factor elF-4A1 Is overexpfessed In melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunit of 
translation initiation factor 3 is amplified and overexpressed 
In breast and prostate cancer (72). and the elF3-p1 10 subunit 
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opment and progres^on of cancer, if any, is not known 

Overexpresslon of S6K. S6K Is amplified and hlghW 
overexpressed in the MCF7 breast cancer cell line, com' 
pared with normal mammary epithelium (74). In astudy by 
Bartund ef al, (74), SSK was ampilfled in 59 of 668 primary 
breast tumors, and a statistically significant association vras 
observed between amplification and poor prognosis. 
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Overexpresslon of PAP. PAP catalyzes 3' p6tm syn- 
thesla. PAP is werexpressed in human cancer cella com- 
pared with nmoBA and vitally tiansfbrmed (TS). PAP 
eruymatlb activity In t}reast 

PAPprct^l0veb(rQand, Inmammaiy tifinorcylos^ 
foimdtobeanindepmiemfactorfbrpcedlcl^ 

IJttle la laiowiv hoWer, about how PAP expression 01 
thoty affects thatranslational profile. 

Alt8iatlon8lnRNA4ilndingProlelii3. Even less Is Known 
a boU a ftemBana h PNfii packaging In cancer. Increased ex- 
pression and nudear locaBzatton of tite FU4A-b(nd^ protein 

YB-1 are IrKficatoTB of a poor prognosis for breast cancer 
non-snwB lung carK^ (76^ and owarian 
eva; this efliBCl may bo medteted at least h part a* the tetfBl ^ 
tenscriptJon, because YB-I increases chemoreslstarice by en- 
handng the tnanscripftion ol a muttkjrug leslsta^ 



AcUvathn of Signal Tramductkm PaUtways 
Activsftlon of signal transduction pathways t)y loss of tumor 
suppressor genes or cvarexpiession of certain lyr^^ 
can cwtlribute to the grovimi «rKi aggres^eness of 
In^xxtert^nrnrtant In human cancm Is the tumor si^}pres8or 

gene P7Bf\^ which leads to the activation of the PI3K^^ 

vi»y. ActivatlDn of Pi3K and Akt Induces the oncogenic trans- 
forrnaBon of cWcken embryo fibroblasts 

^Kiw constitiithfe pho^)h^ylatlon of S6K and of 4E^P] ^m^^ 



phoryiateiS6Kor4&BP1 
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activity but does not phos- 
t tnEffisfonri f%)robfasts. which 
le oncxigenlcay of PI3K and 



Several tyrosine Idnases such as platelet*derWed growth 
fector. Insultn-Uke growfii fector, HEI^eu. and epldemial 
growth factor receptor are overexpressed in cancer. Be- 
cause these kinases activate downstieam signal transduce 
lion pathways known to alter tFEuislatlon inltiattoa actlvaUon 
of translallon Is likely to conlrtbute to the growth and aggres* 
slveness of these tumors. Furthermore, the mRIMA for many 
of these kbtases themselves are under translattonal control. 
For example. HER2/neu mRNA Is translattonally controlled 
both Ijy a short upstream open reading frame that represses 
HER2/neu translatton in a cell type-Independent manner and 
by a distinct cell type-dependent mechanism that increases 
translatlonal efftelency (82). HER2^eu translation Is different 
In transfonmed and nomial cells. Thus, It Is possible that 
alterations at the translatlonal level can In part account for 
the discrepancy between HEf^2/neu gene an)pllfk:atk>n de- 
tected by Huoiesconce In sftu hybrldteatton and protein levels 
detected by immunohlstochemlcal assays. 

Translation Targets of Selected Cancer Therapy 

Components of the translation machinery and signal path- 
ways Involved In the activatton of translatton Initiation repre- 
sent good targets for cancer therapy. 

Targeting tho mTOR Signaling Pathway: Rapanw<N'n 
and Tumsiatin 

Rapamydn Inhibits the prollferatton of lymphocytes. It was 
initially developed as an immunosuppressive drug for organ 



transfdantatton. Rapamydn FKBP 12 (FK506-blndkig 
piolein. M, 12,000) bimia to mlX)R to inl^St its ft^^ 
Rapamycbi cmises a smaQ but s^Htoant reductton In the 

InitiatkmrBteof prot^^nthesls^. Itbk)ck3oeilgrowth in 
part bkxMng S6 phosphofylaUQn and e^ecUvely sui^ 

ptessbig the translaSon <tf mRNAs. such.as ribosomal 
protdns» and elongation factors fB3-8S). Rapamydn also 

bk>cks4&BP1 phb^)hoiyl&dfcm and Inhibits cap^lepe^ 
but not cap-Independent translatton (1 7, 86). 

The r^)amydn-6en&ltlve signd transduction pathway, acti- 
vated ctoing rriallgnanf transfbrmalkm and cancer piogri»^ 
is now bieing studied as a tagk Ibr cancer thersfiy ^ ProQ^ 

tate» breast, sfYiaB ceD lung, gaot^aston», melanoma, and T-cen 
iw^errte are among the cancer fines most sensSthre to the 
r^^wmycfri anatogue CCI-779 (VVye^yeist Research; Ref. 

In ihstbdonrq^oosaiooma 8ms, raparr^^ 
slatte or cytocktel, depending wi the p53 status of the c^ p53 
vvild-type oeSs treated with rapaniycln cnest In t^ 
and maintain thelr.vlabUity, whereas p53 mutant oeOs accumtK 

late InOt and undergo apoptosis (88. 89). In arscently reported 
study idng human prirriltlve neuroectodern^ tumor arui 

meduitobfastoma models, raparnydn exhblted more cytotox- 

k%tooonriblnatlonv>rfft cteplatinandca m ptothec^ 

slngto agertL fri vto, CCK>779 delayed ^owtti of xenografts b^ 

160% aflerl week of therapy and 240% afler2 weeks. Aslnglo 
htghKloee admlnSstratton caused a 37% decrease h tumor 

vdunria Qrowth MiOamon vfto 

dsplatin In comblnatton >Aith CCI-779 than vimh d^^ 

(9(9. Thus, preclinical studies suggest ^ rapamydn an»- 

tofi^ are usehjl as singto agents .and fan comblnatton with 

diemotherapy, 

Rapamydn analogues CCI-779 and RAD001 (Tslovartlo, 
Basel. Switzerland) are now In clinical trials. Because of the 
knovMfY effect of rapamydn on lyn^hocyte proliferatton, a 
potential problem wHh rapamydn analogues Is immunosu|>- 
presslon. However, although protonged Immunosuppresskni 
can result fifom rapamydn and CCh779 administered on 
continuous-dose schedules» the immunosuppressive eAeds 
of rapamydn analogues resolve In --24 h after therapy 
(91). The prindpal toxteitles of CCI-779 have Induded der- 
matotogical toxtelty, myelosuppresston. Infection, mucositta, 
diarrhea, reversible elevations In liver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (B7, 
92-^). Phase 11 trials of CCI-779 have been conducted iri 
advanced renal cell carcinoma and in stage !il/IV breast 
carolnoma patients who felled with prior chemotherapy. In 
the results reported in at)stract form, although there were no 
complete responses, partial responses w«e documented In 
Ijoth renal cell cardnoma and In breast cardnoma P4, 95). 
Thus, 001-779 has documented preliminary clinical activity In 
a prevtously treated, unseleded patient populalton- 

Active Investigation is under way into patient sdectton for 
mTOR inhltrftora. Several studies have found an enhar>ced 
efficacy of CCI-779 In PTEN-null tumore po, 96). Another 
study found that sU of eight breast cancer cetl lines were 
responsive to CCI-779, although onty two of these Hnes 
lacked PTEN (97) There was. however, a positive correlation 
Ijetween AW activation and CCI-779 sensitivity (97). This 
conrelation suggests that activation of the PI3K-Akt pathway. 
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riegardless of vrtwther It b alWbutabte to a PTEN 
to overexpresslon of receptor ty^ 
cer cell amenable to mTOR-dfrected then^sy. In cofrtiast 
taw levels c* the targrt of mTOa 4E-BP1, are cesa 
with rapanydn resistance; thus, a lower 46-BP1/elF4E ratio 
rnay fmlkt rapamycin leelstarice 9^ 
Aiiother iiMxIe of activity for reparny^ 
aw>eaiB to bo through V^hibllkm of anglogeriesls. 

fty riwy be both thn)ugh direct kihlbWcw of aiiib 
proWtollon as aiiesuft of mTOR WiibWo^ 
InMbltlon of transiatton of such proanglogenic fectora as 
vascular eiKiothelW ^ovirth factor In tu^ 

The an^ene^ inhibitor tumstatin, another anticancer 
*ug cunBnU/ under study, was also found recently to Inhibit 
translation In endothelial ccDs (101), Through a reqidstta lr>- 
teractlon with Integrin, tumstatin inhibits activation of the 
PJ3K/Akt pathway and mTOR In mdothellal cella and pre* 
vents dissociation of eIF4E from 4E-BP1, thereby friWbftlng 
cap-dependeril translation. TTiese finding suggest that en- 
dothelial cells are especteJIy sensitive to therapies fai 
the mTOR^tgnallng pathway. 
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inerrt also tectees the expiBsdon of angtogente 

and has been proposed as a potential adjuvant thoapy Ibr heact 
and neck cancer^ partlcuJariy virfien cteyated eIRE b 

niarglis. Smaa niotecite ihhibito 

-binding domain of elF4e are proapoploBc (HQ aid 

beb^ actively pursued. 



Bxptomng Sefectiire Dransiatkm for Owe TTierapy 

A dSfiBrenl theiBpeute approach that takes advanl^ 

enhanced cap-dependert translalton In cancer ceds Is the 
Of gene therapy vectora enomflhg suicide genes with W^iy 

stnjcturad 5' UlRThese ritfWA wouW thus bo at a 

dfeadvants^ In nonnal cefe and not transfala vveO, whereas In 
cancer ceas, they wwWtraffisIato more elfWently^ 

the inlroduction of the 5' Um of ftxoUast grovt^ fector^ 
the oodb^ sequence of he/pes sfrrpfeof v*us 

«Taso gene^ allows for setecth« tnwriatlon of 
**wfi|pe-f thymkHne kinase gene In breast cancer ceO Bnfta 
compared with nonnal manunary ceB lines and results In 
lectiVB eenslfenty to ganciclovir a 1 A 



Targetiif^ elf=2a: EPA, Chtrimsmle, 
dnd Rovonokis 

EPA Is an n-3 potyunsaturated fatty add found in the fish- 
based diets of populations having a low Incidence of cancer 
(102). BPA Irihlbtts the prolifemtlon of cancer cells (103), as 
wen as In animal models (1 04. 1 05), It blocks cell division by 
inhibmng translation mitialion (10^. EPA releases C&^* from 
Intracellular stores whHo Inhibiting their refWlng, thereby ac- 
tivating PKFt PKR, in turn phosphorylates and InhftMts e!F2a, 
resulting In the inhibition of protein synthesis at the level of 
translation Wtlatlon. SImflarly, clotrimazole, a potent antipro- 
liferative agent fri W&o and to Wwo, inhibits cefl growtti through 
depletion of Ca=** stores, activation of PKR. and phospho- 
rylation of elF2a (106). Consequently, clotrimazole preferen- 
tially decreases the expression of cycllna A, E, and D1 
resulting bi blockage of the oeli cyde In . 

mda-7 \& a novel tunrw suppressor gene being developed 
as a gene therapy agent. Adenoviral transfer of mda-7 (Ad- 
mdaT) hduces apoptosis In many cancer cells Including 
breast, colorectal, and lung cancer (107-109). Ad-mdaZ also 
Induces and activates PKR, which leads to phosphorylation 
of elF2a and Induction of apoptosis (1 10). 

Fiavonolds such as genlstein and quercetin suppress tu- 
mor ceH growth. All three nrwmmallan elF2a kinases, PKR, 
heme-regiiated Inhibitor, and PERK/PEK, are activated by 
flavonokis, with phosphorylation of elF2a and Inhibitton of 
protein synthesis (111). 

Targeting elF4A and elF4E: AnUsense RNA 
and Peptides 

Antisenso expression of elF4A decreases the proliferation rate 
of melanoma colls (112). Sequestratton of eIRE by overexpres- 
ston of 4E-BP1 Is proapoptotic and decreases tumorigentelty 
(1 13, 114). Reductton of elF4E with antisense RMA decreases 
soft agar growtti, Increases tumor latency, and increases the 
rates of tumor doubling times (7). Antisense elF4E RNA treat- 



Toward the Future 

Trandatkm b a cnidal process In every cell. However, several 
dlterBlk)ns In tran^onal control occur In ca^^ 

appear to need an aberrantly activated translatkmal state fbr 
swviwal. thus allowing the 1a^ 

suiprteln^ kyw t0>dcity. Components of the translaitonal ma^ 
chlwy, sw^ as elF4E, and signal transtfeictk>n pathways irv 
volved In translatton Irttlalion, such mTOR, represent pronfeing 

for cancer therapy Jnhlbftors off me mTOR have already 
snovwi sonrio preliminary acli^% in dlnk»l trials 
that with the development of tjetter predictive maricers and 

better paflent selectton, response rales to single-agent therapy 
can be Improved. SmBar to other cytostaHcagerrts. however. 
mTOR Inhibitors are most likely to achlefva cOnteal utiity Iri 
comblna(lk)n therapy. In the Interim, our Ina^aslng understrnd- 
Ing of tianstetlon Initiation and signal transductton pathway 

promise to lead to the WenlHkatlon of new therapeutte targets 
In the near futurei 
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We have determined the relationship between mRNA and protein expression levels for selected genes 
expressed in the yeast Saccharomyces cerevisiae growing at mid-log phase. The proteins contained in total yeast 
cell lysate were separated by high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots 
were excised and identified by capillary liquid chromatography-tandem mass spectrometry (LC-MS/MS). 
Protein spots were quantified by metabolic labeling and scintillation counting. Corresponding mRNA levels 
were calculated from serial analysis of gene expression (SAGE) frequency tables (V. E. Velculescu, L. Zhang, 
W. Zhou, J. Vogelstein, M. A. Basrai, D. E. Bassett, Jr., P. Hieter, B. Vogelstein, and K. W. Kinzler, Cell 
88:243-251, 1997). We found that the correlation between mRNA and protein levels was Insufficient to predict 
protein expression levels from quantitative mRNA data. Indeed, for some genes, while the mRNA levels were 
of the same value the protein levels varied by more than 20-fold. Conversely, invariant steady-state levels of 
certain proteins were observed with respective mRNA transcript levels that varied by as much as 30-fold. 
Another interesting observation is that codon bias is not a predictor of either protein or mRNA levels. Our 
results clearly delineate the technical boundaries of current approaches for quantitative analysis of protein 
expression and reveal that simple deduction from mRNA transcript analysis is insufficient. 



« 

The description of the state of a biological system by the 
quantitative measurement of the system constituents is an es- 
sential but largely unexplored area of biology. With recent 
technical advances including the development of differential 
dispIay-PCR (21), of cDNA microarray and DNA chip tech- 
nology (20, 27), and of serial analysis of gene expression 
(SAGE) (34, 35), it is now feasible to establish global and 
quantitative mRNA expression profiles of cells and tissues in 
species for which the sequence of all the genes is known. 
However, there is emerging evidence which suggests that 
mRNA expression patterns are necessary but are by them- 
selves insufficient for the quantitative description of biological 
systems. This evidence includes discoveries of posttranscrip- 
tional mechanisms controlling the protein translation rate (15), 
the halMives of specific proteins or mRNAs (33), and the 
intracellular location and molecular association of the protein 
products of expressed genes (32). 

Proteome analysis, defined as the analysis of the protein 
complement expressed by a genome (26), has been suggested 
as an approach to the quantitative description of the state of a 
biological system by the quantitative analysis of protein expres- 
sion profiles (36). Proteome analysis is conceptually attractive 
because of its potential to determine properties of biological 
systems that are not apparent by DNA or mRNA sequence 
analysis alone. Such properties include the quantity of protein 
expression, the subcellular location, the state of modification, 
and the association with ligands, as well as the rate of change 
with time of such properties. In contrast to the genomes of a 
number of microorganisms (for a review, sec reference II) and 
the transcriptome of Saccharomyces cerevisiae (35). which have 
been entirely determined, no proteome map has been com- 
pleted to date. 

The most common implementation of proteome analysis is 
the combination of two-dimen.sional gel electrophoresis ('2DE) 
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(isoelectric focusing-sodium dodecyl sulfate [SDS]-polyacryl- 
amide gel electrophoresis) for the separation and quantitation 
of proteins with analytical methods for their identification. 
2DE permits the separation, visualization, and quantitation of 
thousands of proteins reproducibly on a single gel (18, 24). By 
itself, 2DE is strictly a descriptive technique. The combination 
of 2DE with protein analytical techniques has added the pos- 
sibility of establishing the identities of separated proteins (1, 2) 
and thus, in combination with quantitative mRNA analysis, of 
correlating quantitative protein and mRNA expression mea- 
surements of selected genes. 

The recent introduction of mass spectrometric protein anal- 
ysis techniques has dramatically enhanced the throughput and 
sensitivity of protein identification to a level which now permits 
the large-scale analysis of proteins separated by 2DE. The 
techniques have reached a level of sensitivity that permits the 
identification of essentially any protein that is detectable in the 
gels by conventional protein staining (9, 29). Current protein 
analytical technology is based on the ma.ss spectrometric gen- 
eration of peptide fragment patterns that are idiotypic for the 
sequence of a protein. Protein identity is established by corre- 
lating such fragment patterns with sequence databases (10, 22, 
37). Sophisticated computer software (8) has automated the 
entire process such that proteins are routinely identified with 
no human interpretation of peptide fragment patterns. 

In this study, we have analyzed the mRNA and protein levels 
of a group of genes expressed in exponentially growing cells of 
the yeast S. cerevisiae. Protein expression levels were quantified 
by metabolic labeling of the yeast proteins to a steady state, 
followed by 2DE and liquid scintillation counting of the se- 
lected, separated protein species. Separated proteins were 
identified by in-gel tryptic digestion of spots with subsequent 
analysis by microspray liquid chromatography- tandem ma.ss 
spectrometry (LC-MS/MS) and sequence database searching. 
The corresponding mRNA transcript levels were calculated 
from SAGE frequency tables (35). 

This study, for the first time, explores a quantitative com- 
parison of mRNA transcript and protein expression levels for 
a relatively large number of genes expressed in the same met- 
abolic state. The resultant correlation is insufficient for prcdic- 



1 720 



Vol. 19, 1999 



CORRELATION BETWEEN PROTEIN AND mRNA LEVELS IN YEAST 1721 










1 








Peptides 









— 1 — 1 — 1 11 
12 


1 1 1 I.I 1 — 1 — 1 — r 

14 16 
HmcCinin) 



Separate peptides 





^ WilTiiiiT r iiTrri m i rii'ti 1 1 1 miAtimiimm' 



Quadrupole 1 



Quadrupole 2 
Collision Cell 




Quadrupole 3 Tandem mass spectrum 





Theoretical 



Computer sequence 
database searching 



Peptide identification, 
Protein identification 




200 400 600 800 1000 1200 



Acquired 



FIG. 1 . Schematic illustration of protcomc analysis by 2DE and mass spcclromctry. In part I. proteins arc separated by 2DE, stained spots are excised and subjected 
to in-gcl digestion with trypsin, and the resulting peptides are separated by on-line capillary high-performarice liquid chromatography. In part II, a peptide is shown 
eluting from the column in pan 1. The peptide is ionized by electrospray ionization and enters the mass spectrometer. The mass of the ionized peptide is detected, and 
the first quadrupole mass fiher allows only the specific mass-io-charge ratio of the selected peptide ion to pass into the collision cell. In the collision cell, the energized, 
ionized peptides collide with neutral argon gas molecules. Fragmentation of the peptide is essentially random but occurs mainly at the peptide bonds, resulting in smaller 
peptides of differing lengths (masses). These peptide fragments are detected as a tandem mass (MS/MS) spectrum in the third quadrupole ma.ss filter where two ion 
series are recorded simultaneously, one each from sequencing inward from the N and C termini of the peptide, respectively. In part III, the MS/MS spectrum from the 
selected, ionized peptide is compared to predicted landcm mass spectra computer generated from a sequence database. Provided that the peptide sequence exists in 
the database, the peptide and, by as-sociation, the protein from which the peptide was derived can be identified. Unambiguous protein identification is attained in a single 
analysis because multiple peptides are identified as being derived from the same protein. 



lion of protein levels from mRNA transcript levels. We have 
also compared the relative amounts of protein and mRNA 
with the respective codon bias values for the corresponding 
genes. This comparison indicates that codon bias by itself is 
insufficient to accurately predict either the mRNA or the pro- 
tein expression levels of a gene. In addition, the results dem- 
onstrate that only highly expressed proteins are detectable by 
2DE separation of total cell lysates and that theiefore the 
construction of complete proteome maps with current technol- 
ogy will be very challenging, irrespective of the type of organ- 
ism. 

MATERIALS AND METHODS 

Yeast strain and gmwih c<inditions. The source of protein and message iran- 
scrrpls for all experiments was YPI149'; (M/I7a tmi3 52 lys2-SOI uJe2-WI 
Iiu2-^i his3'i^200 (30). Logarithmically jirowlng were dbtaincd by 

growing yeast cells to early log phase O x 1U''ccil.s/ml) in YPI) rich medium 
(YPD supplemented with 6 mM uracil, 4.8 mM adenine, and 24 mM tr\ptophan| 
ai MTV Metabolic labclinjt of protein was accomplished in YPI) medium 



exactly as described elsewhere (4) with the exception that 1 ml of cells was 
labeled with 3 mCi lu offset methionine present in YPD medium. Protein was 
harvested as described by Garrcls and coworkers (12). Harvested protein was 
lyophilized, resu.spended in isoelectric focusing gel rehydration solution, and 
stored at -WC. 

2DE. Soluble proteins were run in the first dimension by using a commercial 
tlatbed electrophoresis system (Multiphor II; Pharmacia Biotech). Immobilized 
polyacrylamidc gel (IPG) dry strips with nonlinear pH .^.0 lo 10.0 gradients 
(Amcrsham-Pharmacia Biotech) were used for the first -dimension separation. 
Forty micrograms of protein from wholc-ccll lysates was mixed with IPG strip 
rehydration buffer (8 M urea, 2% Nonidet P-4U, 10 mM dithiothrcitol), and 250 
to ^HO ^ll of solution w:<s added to individual lanes of an IPG strip rehydration 
tray (Amcrsham-Pharmacia Biotech). The strips were allowed lo rchydrale at 
room temperature for 1 h. The samples were run at 300 V-IO mA-5 W for 2 h, 
then ramped to 3,.StX» V-IO mA-5 W twcr a period of 3 h. and then kept at 3..*»00 
V-IO mA-5 W for 15 lo 19 h. At the end of the first-dimension run (60 to 70 kV • 
h), the IPC strips were rccquilibraied for 8 min in 27*. (wiAfol) dilhiothrciiol in 
29/ (wtA'ol) SDS-() M urea-30'>;.- {w\No\) glyccrol-0.05 M Tris MCI (pll 6.8) and 
ftir 4 min in 2.$% iodoacetaniide in 2% (wiMil) SDS-6 M urea-30% (wtAol) 
glyceroMUlS M Tris MCI (pH 6.8). Following rccquilibraliim. the strips were 
transferred and apposed to U}^- polyacr\'lamidc second-dimcnsiim gols. Poly- 
acrylamide gels were |X)ured in a casting stand with ID^'r acrylamidc-2.67^f: 
piperazine diacrylamide-f 1.375 M Trisbasc-Mtil (pM «.«)-<). I % |wi/Col) SDS-<).n5% 
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FIG. 2. 2D silver-slaincd gel of ihe proleins in ycasl lotal celJ lysalc. Proteins were separated in the first dimension (horizonial) by isoelectric focusing and then in 
the second dimension (vertical) by molecular weight sieving. Protein spots (156) were chosen to include the entire range of molecular weights, isoelectric focusing points, 
and staining intensities. Spots were excised, and the corresponding protein was identified by mass spectrometry and database searching. The spots arc labeled on the 
gel and correspond to the data presented in Tabic 1. Molecular weights are given in thousands. 



(wtyvol) ammonium persulf:iic-0.05% TEMED (iVA'A'A'-tetrameihylethyl- 
enediamine) in Milli-Q water. The apparatus used to run second-dimension gels 
was a noncommercial apparatus from Oxford Glycoscicnces, Inc. Once the IPG 
strips were apposed to the second-dimension gels, they were immediately run at 
50 mA (constant)-50() V-85 W for 20 min, followed by 200 mA (const ant)-500 
V-85 W until the buffer front line was 10 to 15 mm from the bottom of the gel. 
Gels were removed and .silver stained according to the procedure of Shcvchcnko 
ct a I. (29). 

Protein idenlificalion. Gels were exposed tt) X-ray film overnight, and then the 
silver staining and film were used to excise 156 spots of varying intensities, 
molecular weights, and isoelectric focusing points. In order to increase the 
detection limit by mass spectrometry, spt»ls were cut out and pix^lcd from up to 
four identical cold, sllvcr-siaincd gels. In-gel tryptic digests of pooled spots were 
performed as described previously (29). Tr>ptic peptides were analyzed by mi- 
ciocapillary LC-MS with automated switching lo MS/MS mode lor peptide 
fragmentation. Spectra were .searched against the composite OWL protein se- 
quence database (version 30.2; 250.514 protein sequences) (24a) by using the 
computer program Sequcsl (S). which matches theoretical arid acquired tandem 
mu.ss spectra. A protein match was determined by comparing the number of 
pcpiidcs idcntiricd ;ind their respective cross-correlation scores. All protein 
idcntilicahons were vcritkd by comparison with theoretical molecular weights 
and isoelectiic points. 



mRNA quantitation. Velculcscu and coworkers have previously generated 
frequency tables tor yeast mRNA transcripts from the same strain grown under 
the same slated conditions as described herein (35). The SAGE technology is 
based on two main principles. First, a short sequence tag (15 bp) that contains 
suiricient information uniquely to identify a transcript is generated. A single tag 
is usually generated from each mRNA transcript in the cell which corresponds to 
15 bp at the 3'-most cutting site for Mi 111. Second, many transcript tags can be 
concatenated into a single molecule and then sequenced, revealing the identity of 
multiple taps simultaneously. Over 20,000 transcripts were sequenced from yeast 
strain YPH499 growing at mid-log phase on glucose. Assuming the previously 
derived estimate of 15,000 mRNA molecules per cell (16), this would represent 
a 1.3-fi)ld coverage even for mRNA molecules present at a single copy per cell 
and would provide a probability <»f detecting such transcripts. (Tompuler 
software which ii>ok ft»r input ihe gene detected, examirred the nucleotide se- 
quence, and performed the calculation a.«. described by Velculcscu and ctiworkers 
(.^5) w;is wriuen. In practice, we found that for 21 of 128 (16%) genes examined 
viable mRNA levels from SAGH data could not be calculated. This was because 
(i) no CATG site was (oum\ in the open reading frame (ORF). (ti) a CATG site 
was found but the corresponding lO-bp putative SAGE tag was not found in the 
frequency tables, or (iii) identical putative SAGK tags were present for muhiplc 
genes (e.g.. TI.>H2_YEAS 1 and IDH.^^YKAS T). 
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TABLE 1. Expressed genes identified from 2D gel in Fig. 2 
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TABLE 1 — Continued 
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6.47 


18 


ACHl 


5.4 


1.5 


0.327 


61,353 


5.87 


21 


PDCl 


6.5 


200.7 


0.962 


61,353 


5.87 ■ 


22 


POCl 


303.2 


200.7 


0.962 


b 1 ,353 


5.87 


23 


POCl 


16.3 


200.7 


0.962 


61,649 


5.54 


38 


ccrs 


7 -> 


L5 


0.271 



Coniiiuted 



Continued on JoUowin^ puiie 
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TABLE 1 — Continued 



Mol wt 




Spot no. 


YPD Bcnc 
name' 


I Foicin 

9.bunHn ncc 

(10^ copies/ 
cell) 


mRNA 
abundance 
(copies/celi) 


bias 


61,902 


6.21 


101 


PDC5 


4.3 


NA*^ 


0.828 


62,266 


6.19 


16 


ICLl 


20.1 


NA*^ 


0.327 


62,862 


8.02 


19 


ILV3 


5.3 


4.5 


0.548 


63,082 


6.40 


119 


PGM2 


Z2 


3.0 


0.402 


64,335 


5.77 


5 


PABl 


30.4 


1.5 


0.616 


66,120 


5.42 


8 


STIl 


6.7 


0.7 


0.313 


66,120 


5.42 


9 


STIl . 


6.4 


0.7 


0.313 


66.450 


5.29 


141 


SSB2 


7.0 


NA^ 


0.880 


66,450 


5.29 


142 


SSB2 


2.3 


• NA^ 


0.880 


66,456 


5.23 


10 


SSBl 


64.5 


79.5 


0.907 


66,456 


5.23 


11 


SSBl 


59.0 


79.5 


0.907 


66,456 


5.23 


12 


SSBl 


13.7 


79.5 


0.907 


68 397 


5 82 


82 


LEU4 


3 1 


3 0 


0 407 


69,313 


4.90 


13 


SSA2 


24.3 


18.6 


0.892 


69,313 


4.90 


14 


SSA2 


77.1 


18.6 


0.892 


74,378 


8.46 


15 


YKL029C 


2.8 


3.7 


0353 


75,396 


5.82 


6 . 


GRSl 


5.5 


7.4 


0.500 


85,720 


6.25 


1 


MET6 


2.0 


NA"^ 


0.772 


85,720 


6.25 


2 


MET6 


10.9 


NA*^ 


0.772 


85,720 


6.25 


3 


MET6 


1.4 


NA*^ 


0.772 


93,276 


6.11 


131 


EFn 


17.9 


41.6 


0.890 


93,276 


6.11 


132 


EFIl 


5.7 


41.6 


0.890 


102,064*^ 


6.6r 


94 


ADE3 


4.8 


5.2 


0.423 


107,482^ 




4 


MCM3 


. 2.7 


NA'^ 


0.240 



* YPD gene names arc available from the YPD website (39), 
^ NA, calculation could not be performed or was not available. 
''mRNA data inconclusive or NA, 

^ No methionines in predicted ORF: therefore, protein concentration was not 
determined. 

' Measured molecular weight or pi did not match theoretical molecular weight 
or pL 



Protein quantitation. (-^^Slmeihioninc-labcled gels were exposed to X-ray film 
overnight, and then the silver stain and film v^cre used to excise 156 spots of 
varying intensities, molecular weights, and pis. The excised spots were placed in 
0.6-ml microcentrifuge tubes, and scintillation cocktail (100 jxl) was added. The 
samples were vortcxed and counted. In addition, two parallel gels were electro- 
blotted to polyvinylidenc difluoride membranes. The membranes were exposed 
to X-ray film, and four intense single spots were excised from each membrane 
and subjected to amino acid analysis. For these four spots, a mean of 209 ± 4 
cpm/pmol of protein/mcthiuninc was found. This number was used to quantitatc 
all remaining spots in conjunction with the number of methionines present in the 
protein. 

To ensure that proteins were labeled to equilibrium, parallel 2D gels were 
prepared and run on yeast metabolically labeled for K 2, 6, or IS h. The 
corres{>onding 156 spots were excised frmn each gel, and radioactivity was mea- 
sured by liquid scintillation counting for each spot. Calculated protein levels were 
highly reproducible for all time points measured after 1 h. 

Calculation of codon bias and predicted half-life. Codon bias values were 
extracted from the YPD spreadsheet (17). Protein half-lives were calculated 
based on the N-cnd rule (33). When the .\-terminaI processing was not known 
experimentally, it was predicted based on the aflinily of methionine aminopcp- 
tidasc(31). 

RESULTS 

Characteristics of proteonie approach. Nearly every facet of 
proteome analysis hinges on the unambiguous identification of 
large numbers of expressed proteins in cells. Several tech- 
niques have been described previously for the identification of 
proteins separated by 2DE, including N-terminal and internal 
sequencing (j. 2), amino acid analysis (38), and more recently 
mass .spectrometry (25). We utilized techniques based on mass 
spectrometry because they afford the highest levels of sen.sitiv- 
ity and provide unambiguous identilication. The specific pro- 
cedure used is schematically illustrated in Fig. 1 and is based 
on three principles. First, proteins are removed from the gel by 



proteolytic in-gel digestion, and the resulting peptides are sep- 
arated by on-line capillary high-performance liquid chromatog- 
raphy. Second, the elating peptides are ionized and detected, and 
the specific peptide ions are selected and fragmented by the 
mass spectrometer. To achieve this, the mass spectrometer 
switches between the MS mode (for peptide mass identifica- 
tion) and the MS/MS mode (for peptide characterization and 
sequencing). Selected peptides are fragmented by a process 
called collision-induced dissociation (CID) to generate a tan- 
dem mass spectrum (MS/MS spectrum) that contains the pep- 
tide sequence information. Third, individual CID mass spectra 
are then compared by computer algorithms to predicted spec- 
tra from a sequence database. This results in the identification 
of the peptide and, by association, the protein(s) in the spot. 
Unambiguous protein identification is attained in a single anal- 
ysis by the detection of multiple peptides derived from the 
same protein. 

Protein identification. Yeast total cell protein lysate (40 (ig), 
metabolically labeled with [^^S]methionine, was electro- 
phoretically separated by isoelectric focusing in the first dimen- 
sion and by SDS~10% polyacrylamide gel electrophoresis in 
the second dimension. Proteins were visualized by silver stain- 
ing and by autoradiography. Of the more than 1,000 proteins 
visible by silver staining, 156 spots were excised from the gel 
and subjected to in-gel tryptic digestion, and the resulting 
peptides were analyzed and identified by microspray LC- 
MS/MS techniques as described above. Tlie proteins in this 
study were all identified automatically by computer software 
with no human interpretation of mass spectra. They are indi- 
cated in Fig. 2 and detailed in Table 1. 

The CID spectra shown in Fig. 3 indicate that the quality of 
the identification data generated was suitable for unambiguous 
protein identification. The spectra represent the amino acid 
sequences of tryptic peptides NSGDIVNLGSIAGR (Fig. 3A) 
and FAVGAFTDSLR (Fig. 3B). Both peptides were derived 
from protein S57593 (hypothetical protein YMR226C), which 
migrated to spot 114 (molecular weight, 29,156; pl, 6.59) in the 
2D gel in Fig. 2. Five other peptides from the same analysis 
were also computer matched to the same protein sequence. 

Protein and mRNA quantitation. For the 156 genes investi- 
gated, the protein expression levels ranged from 2,200 (PGM2) 
to 863,000 (TDH2yTDH3) copies/cell. The levels of mRNA for 
each of the genes identified were calculated from SAGE fre- 
quency tables (35). These tables contain the mRNA levels for 
4,665 genes in yeast strain YPH499 grown to mid-log phase in 
YPD medium on glucose as a carbon source. In some in- 
stances, the mRNA levels could not be calculated for reasons 
stated in Materials and Methods. For the proteins analyzed in 
this study, mean transcript levels varied from 0.7 to 473 copies/ 
cell. 

Selection of the sample population for mRNA-protein ex- 
pression level correlation. The protein spots selected for iden- 
tification were selected from spots visible by silver staining in 
the 2D gel. An attempt was made not to include spots where 
overlap with other .spots was readily apparent. The number of 
proteins identified was 156 (Table 1). Some proteins migrated 
to more than one spot (presumably due to differential protein 
processing or modifications)^ and protein levels from these 
spots were calculated by integrating the intensities of the dif- 
ferent spots. The 156 protein spots analyzed represented the 
products of 128 diflerent gene.s. Genes were excluded from the 
correlation analysis only if part of the data set was missing; i.e.. 
genes were excluded if (i) no mRNA expression data were 
available for the protein or putative SAGE lags were ambig- 
uous, (ii) the amino acid sequence did not contain methionine, 
(iii) more than a single protein was conclusively idcntihed as 
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FIG. 3. Tandem mass (MS/MS) spectra resulting from analy.sis of a single spot on a 2D gel. The first quadrupolc sclccicd a single mass-io-chargc ratio (m/z) of 687.2 
(A) or 592.6 (B), while the collision cell was filled with argon gas, and a voltage which caused the peptide to undergo fragmentation by CID was applied. The third 
quadrupolc scanned the mass range from 50 to 1,400 m/z. The computer program Sequcst (8) was utilized to match MS/MS spectra lo amino acid sequence by database 
searching. Both spectra matched peptides from the same protein, S57593 (yeast hypothetical protein YMR226C). Five other peptides from the same analysis were 
matched to the same protein. 



migrating to the same gel spot, or (iv) the theoretical and 
observed pis and molecular weights could not be reconciled. 
Alter these criteria were applied, the number of genes used in 
the correlation analysis was 106. 



Codon bias and predicted half-lives. Codon bias is thought 
to be an indicator of protein expression, with highly expressed 
proteins having large codon bias values. The codon bias di.stri- 
bution for the entire set of more than 6.(KX) predicted yeast 
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gene ORFs is presented in Fig. 4A. The interval with the 
largest frequency of genes is between the codon bias values of 
0.0 and 0.1. This segment contains more than 2^00 genes. The 
distribution of the codon bias values of the 128 different genes 
found in this study (all protein spots from Fig, 2) is shown in 
Fig. 4B, and protein half-h'ves (predicted from applying the 
N-end rule [33] to the experimentally determined or predicted 
protein N termini) are shown in Fig. 4C No genes were iden- 
tified with codon bias values less than 0.1 even though thou- 
sands of genes exist in this category. In addition, nearly all of 
the proteins identified had long predicted half-lives (greater 
than 30 h). 

Correlation of mRNA and protein expression levels. The 
correlation between mRNA and protein levels of the genes 
selected as described above is shown in Fig. 5. For the entire 
group (106 genes) for which a complete data set was gener- 
ated, there was a general trend of increased protein levels 
resulting from increased mRNA levels. The Pearson product 
moment correlation coefficient for the whole data set (106 
genes) was 0.935. This number is highly biased by a small 
number of genes with very large protein and message levels. A 
more representative subset of the data is shown in the inset of 
Fig. 5. It shows genes for which the message level was below 10 
copies/cell and includes 69% (73 of 106 genes) of the data used 
in the study. The Pearson product moment cori*elation coeffi- 
cient for this data set was only 0.356. We also found that levels 
of protein expression coded for by mRNA with comparable 
abundance varied by as much as 30-fold and that the mRNA 
levels coding for proteins with comparable expression levels 
varied by as much as 20-fold. 

The distortion of the correlation value induced by the un- 
even distribution of the data points along the x axis is further 
demonstrated by the analysis in Fig. 6. The 106 samples in- 
cluded in the study were ranked by protein abundance, and the 
Pearson product moment correlation coefficient was repeat- 
edly calculated after including progressively more, and higher- 
abundance, proteins in each calculation. The correlation values 
remained relatively stable in the range of 0.1 to 0.4 if the 
lowest -expressed 40 to 95 proteins used in this study were 
included. However, the correlation value steadily climbed by 
the inclusion bf each of the 11 very highly expressed proteins. 

Correlation of protein and mRNA expression levels with 
codon bias. Codon bias is the propensity for a gene to utilize 
the same codon to encode an amino acid even though other 
codons would insert the identical amino acid in the growing 
polypeptide sequence. It is further thought that highly ex- 
pressed proteins have large codon biases (3). To assess the 
value of codon bias for predicting mRNA and protein levels in 
exponentially growing yeast cells, we plotted the two experi- 
mental sets of data versus the codon bias (Fig. 7). The distri- 
bution patterns for both mRNA and protein levels with respect 
to codon bias were highly similar. There was high variability in 
the data within the codon bias range of 0.8 to 1.0. Although a 
large codon bias generally resulted in higher protein and mes- 
sage expression levels, codon bias did not appear to be predic- 
tive of cither protein levels or mRNA levels in the cell. 

DISCUSSION 

The desired end point for the description of a biological 
system is not the analysis of mRNA transcript levels alone but 
also the accurate measurement of protein expression levels and 
their respective activities. Quantitative analysis of global 
mRNA levels currently is a preferred methi)d for the analysis 
of the state of cells and tissues (II). Several methods which 
either provide absolute mRNA abundance (34, 35) or relative 
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mRNA levels in comparative analyses (20, 27) have been de- 
scribed elsewhere. Tlie techniques are fast and exquisitely sen- 
sitive and can provide mRNA abundance for potentially any 
expressed gene. Measured mRNA levels are often implicitly or 
explicitly extrapolated to indicate the levels of activity of the 
corresponding protein in the cell. Quantitative analysis of pro- 
tein expression levels (proieome analysis) is much more time- 
consuming becau.se proteins are analyzed .sequentially one by 
one and is not general becau.se analyses are limited to the 
relatively highly expressed proteins. Proteome analysis does, 
however, provide types of data that are of critical importance 
for the description of the slate of a biological system and that 
are not readily apparent from the sequence and the level of 
expression of the mRNA transcript. This study attempts to 
examine the relationship between mRNA and protein expres- 
sion levels for a large number of expressed genes in cells 
representing the same state. 

Limils in the sensilivily of current protein analysis technol- 
ogy precluded a completely random sampling of yeast proteins. 
We therefore based the study on those proteins visible by silver 
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FIG. 5. Correlation between protein and mRNA levels for 106 genes in yeast growing at log phase with glucose as a carbon source. mRNA and protein levels were 
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69% of the original data set. The Pearson product moment correlation for the entire data set was 0.935. The correlation for the insei containing 73 proteins (69%) was 
only 0.356. 



Staining on a 2D gel. Ot the more than 1,000 visible spots, 156 
were chosen to include the entire range of molecular weights, 
isoelectric focusing points, and staining intensities displayed on 
the 2D protein pattern. The genes identified in this study 
shared a number of properties. First, all of the proteins in this 
study had a codon bias of greater than O.J and 93% were 
greater than 0.2 (Fig. 4B). Second, with few exceptions, the 
proteins in this study had long predicted half-lives according to 
the N-end rule (Fig. 4C). Third, low-abundance proteins with 
regulatory functions such as transcription factors or protein 
kinases were not identified. 

Because the population of proteins used in this study ap- 
pears to be fairly homogeneous with respect to predicted half- 
life and codon bia.s, it might be expected that the correlation of 
the mRNA and protein expression levels would be .stronger for 
this population than for a random sample of yeast proteins. We 
tested this assumption by evaluating the correlation value if 
different subsets of the available data were included in the 
calculation. The 106 proteins were ranked from lowest to high- 
est protein expression level, and the trend in the correlation 
value was evaluated by pi ogressivcly including more of the 
higher-abundance proteins in the calculation (Fig. 6). The cor- 
relaiion value when only the lower-abundance 40 to 93 pro- 
teins were examined was consi.stently between O.l and 0.4. If 
the ] J most abundant proteins were included, the correlation 
steadily increased to 0.94. We therefore expect that the corre- 
lation for all yeasi proleins or for a random selection would be 
less than 0.4, The observed level of correlation between 
mRNA and protein cxpiession levels suggests the importance 



of posttranslational mechanisms controlling gene expression. 
Such mechanisms include trahslational control (15) and con- 
trol of protein half-life (33). Since these mechanisms are also 
active in higher eukaryotic cells, we speculate that there is no 
predictive correlation between steady-state levels of mRNA 
and those of protein in mammalian cells. 

Like other large-scale analyses, the present study has several 
potential sources of error related to the methods used to de- 
termine mRNA and protein expression levels. The mRNA 
levels were calculated from frequenc7 tables of SAGE data. 
This method is highly quantitative because it is based on actual 
sequencing of unique tags from each gene, and the number of 
times that a tag is represented is proportional to the number of 
mRNA molecules for a specific gene. This method has some 
limitations including the following: (i) the magnitude of the 
error in the measurement of mRNA levels is inversely propor- 
tional to the mRNA levels, (ii) SAGE tags from highly similar 
genes may not be distinguished and therefore are summed, (iii) 
some SAGE tags arc from sequences in the 3' untranslated 
region of the transcript, (iv) incomplete cleavage at the SAGE 
tag site by the restriction enzyme can result in two tags repre- 
senting one niRNA, and (v) some transcripts actually do not 
generate a SAGE tag (34, 35). 

For the SAGE method, the error associated with a value 
increases with a decreasing number of transcripts per cell. The 
conclusions drawn from this .study are dependent on the qual- 
ity of the mRNA levels from previi)usly published data (35). 
Since more than 65% of the mRNA levels included in this 
study were calculated to 10 copies/cell or less (40%- were less 
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. than 4 copies/cell), the error associated with these values may 
be quite large. The mRNA levels were calculated from more 
than 20,000 transcripts. Assuming that the estimate of 15,000 
mRNA molecules per cell is correct (16), this would mean that 
mRNA transcripts present at only a single copy per cell would 
be detected 72% of the time (35). The mRNA levels for each 
gene were carefully scrutinized, and only mRNA levels for 
which a high degree of confidence existed were included in the 
correlation value. 

Protein abundance was determined by metabolic radiolabel- 
ing with [ Sjmethionine, The calculation required knowledge 
of three variables: the number of methionines in the mature 
protein, the radioactivity contained in the protein, and the 
specific activity of the radiolabel normalized per methionine. 
The number of methionines per protein was determined from 
the amino acid sequence of the proteins identified by tandem 
mass spectrometry. For some proteins, it was not known 
whether the methionine of the nascent polypeptide was pro- 
cessed away. The N termini of those proteins were predicted 
based on the spccificit>' of methionine aminopcptidase (31), If 
the N-tcrminal piocessing did not conform to the predicted 
specificit)' of processing enz>'mes, the calculation of the num- 
ber of methionines would be alfected. This discrepancy would 
affect most the quantitation of a protein with a ver>' low num- 
ber of methionines. The average number of calculated methi- 
onines per protein in this study was 7.2. We therefore expect 
the poienlial for erroneous protein quantitation due to un- 
usual N-tcrminal processing to be small. 



The amount of radioactivity contained in a single spot might 
be the sum of the radioactivity of comigrating proteins. Be- 
cause protein identification was based on tandem mass spec- 
trometric techniques, comigrating proteins could be identified. 
However, comigrating proteins were rarely detected in this 
study, most likely because relatively small amounts of total 
protein (40 jxg) were initially loaded onto the gels, which re- 
sulted in highly focused spots containing generally 1 to 25 ng of 
protein. Because of the relatively small amount loaded, the 
concentrations of any potentially comigrating protein would 
likely be below the limit of detection of the mass spectrometry 
technique used in this study (1 to 5 ng) and below the limit of 
visualization by silver staining (1 to 5 ng). In the overwhelming 
majority of the samples analyzed, numerous peptides from a 
single protein were detected. It is assumed that any comigrat- 
ing proteins were at levels too low to be detected and that their 
influence in the calculation would be small. 

The specific activit>' of the radiolabel was determined by 
relating the precise amount of protein present in selected spots 
of a parallel gel, as determined by quantitative amino acid 
composition analysis, to the number of methionines present in 
the sequence of those proteins and the radioactivity deter- 
mined by liquid scintillation counting. It is possible thai the 
resulting number might be influenced by unavoidable losses 
inherent in the amino acid analysis procedure applied. Because 
four diftcreni proteins were utilized in the calculation and the 
experiment wijs done in duplicate, the specific activity calcu- 
lated is thought to be highly accurate. Indeed, the specific 
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activities calculated for each of the four proteins varied by less 
than 10%. Any inconsistencies in the calculation of the specific 
activity would result in differences in the absolute levels calcu- 
lated but not in the relative numbers and would therefore not 
influence the correlation value determined. 

The protein quantitative method used eliminates a number 
of potential errors inherent in previous methods for the quan- 
titation of proteins separated by 2DE, such as preferential 
protein staining and bias caused by inequalities in the number 
of radiolabeled residues per protein. Any 2D gel-based method 
of quantitation is complicated by the fact that in some cases the 
translation products of the same mRNA migrated to different 
spots. One major reason is posttranslational modification or 
processing of the protein. Also, artifactual proteolysis during 
cell lysis and sample preparation can lead to multiple resolved 
forms of the protein. In such cases, the protein levels of spots 
coded for by the .same mRNA were pooled. In addition, the 
existence of other spots coded for by the same mRNA that 
were not analyzed by mass speclrometr>' or that were below the 
limit of detection for silver staining cannot be ruled out. How- 
ever, since this study is based on a class of highly expressed 
proteins, the presence of undetected minor spots below silver 
staining sensitivity corresponding to a protein anal>'zed in the 
study would generally cause a relatively small error in protein 
qunniitaiion. 

Codon bias is a measure of the propensity of an organism to 
selectively utilize certain codons which result in the incorpo- 
ration of the same amino acid residue in a growing polypeptide 
chain. There are 61 po.ssible codons that code for 20 amino 
acids. The larger the codon bias value, the smaller the number 
of codons that arc u.scd to encode the protein (19). It is 



thought that codon bias is a measure of protein abundance 
because highly expressed proteins generally have large codon 
bias values (3, 13). 

Nearly all of the most highly expressed proteins had codon 
bias values of greater than 0.8. However, we detected a number 
of genes with high codon bias and relative low protein abun- 
dance (Fig. 7). For example, the expressed gene with both the 
second largest protein and mRNA levels in the study was 
EN02_YEAST (775,000 and 289.1 copies/cell, respectively). 
EN01_YEAST was also present in the gel at much lower 
protein and mRNA levels (44,200 and 0.7 copies/cell, respec- 
tively). The codon bias values for EN02 and ENOl are similar 
(0.96 and 0.93, respectively), but the expression of the two 
genes is differentially regulated. Specifically, EN01_YEAST is 
glucose repressed (6) and was therefore present in low abun- 
dance under the conditions used. Other genes with large codon 
bias values that were not of high protein abundance in the gel 
include EFTL TIFl, HXK2, GSPl, EGD2, SHM2, and TALI. 
We conclude that merely determining the codon bias of a gene 
is not sufficient to predict its protein expression level. 

Interestingly, codon bias appears to be an excellent indicator 
of the boundaries of current 2D gel protcome analysis tech- 
nology. There are thousands of genes with expressed mRNA 
and likely expressed protein with codon bias values less than 
0.1 (Fig. 4A). In this study, we detected none of them, and only 
a very small percentage of the genes detected in this study had 
codon bias values between 0.1 and 0.2 (Fig. 4B). Indeed, in 
every examined yeast proteome study (5, 7, 13, 28) where the 
combined total number of identified proteins is 300 to 400, this 
same observation is true. It is' expected that for the more 
complex cells of higher cukaryolic organisms the detection of 
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low-abundance proteins would be even more challenging than 
for yeast. This indicates that highly abundant, long-lived pro- 
teins are overwhelmingly detected in proteome studies. If pro- 
teome analysis is to provide truly meaningful information 
about cellular processes, it must be able to penetrate to the 
level of regulatory proteins, including transcription factors and 
protein kinases. A promising approach is the use of narrow- 
range focusing gels with immobilized pH gradients (IPG) (23). 
This would allow for the loading of significantly more protein 
per pH unit covered and also provide increased resolution of 
proteins with similar electrophoretic mobilities. A standard pH 
gradient in an isoelectric focusing gel covers a 7-pH-unit range 
(pH 3 to 10) over 18 cm. A narrow-range focusing gel might 
expand the range to 0.5 pH units over 18 cm or more. This 
could potentially increase by more than 10-fold the number of 
proteins that can be detected. Clearly, current proteome tech- 
nology is incapable of analyzing low-abundance regulatory pro- 
teins without employing an enrichment method for relatively 
low-abundance proteins. In conclusion, this study examined 
the relationship between yeast protein and message levels and 
revealed that transcript levels provide little predictive value 
with respect to the extent of protein expression. 
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.'dHacte. If tliese minor cell proteins differ among cells to the same extent as the 
jjjore abiindant proteins, as is commonly assumed, only a small number of pro- 
^ differences (perhaps several hundred) suffice to create very large differences 
in'cdl morphology and behavior. 
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A Cell Can Change the Expression of Its Genes 
in Response to External Signals ^ 

Most of the specialized cells in a multicellular organism are capable of altering 
jjicir patterns of gene expression in response to extracellular cues. If a liver ceB 
isei9}0sed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
tiansfaase, which helps to convert tyrosme to glucose. When the hormone is ho 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat ceDs, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specialization--different cell types often respond hi dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distmc- 
live character. These features reflect the persistent expression of different sets of 
genes. 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein ^ 

If differences between the various ceU types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a ceU can control the proteins it 
makes by (1) controlling when and how often a given gene is transcribed (tran- 
Hjiptional control), (2) controlling how the primary RNA transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
mRNAs m the cell nucleus are exported to the cytoplasm (RNA transport con- 
(4) selecting which mRNAs in the cytoplasm are translated by ribosomes 
jranslational control), (5) selectively destabilizing certain mRNA molecules in 
cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9-2). 

h^^^ ^^^^ ^^"^^ transcriptional controls are paramount. This makes sense 
ecause, of aU the possible control points iUustrated in Figure 9-2, only transcrip- 
nai control ensures that no superfluous intermediates are synthesized. In die 
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Figure 9-2 Six steps at which 
eucaryote gene expression can be 
controlled. Only controls that operate 
at steps I through 5 are discussed in 
this chapter. The regulation of protein 
activity (step 6) is discussed in 
Chapter 5; this includes reversible 
activation or inactivation by protein 
phosphorylation as well as 
irreversible inactivation by proteolytic 
degradation. 
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following sections we discuss the DNA and protein components that regulate the 
initiation of gene transcription. We return at the end of the chapter to the other 
ways of regulating gene expression. 



Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein andBNA molecules, A cell typically expresses only d 
fraction ofits genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes tivey express in response to dumges in their environment, sudi as signals from 
other cells. Although all of the steps involved in expressing a gene can in principle be 
regulated, for most genes the initiation ofRNA transcription is the most important 
point of control 



DNA-binding Motifs in Gene 
Regulatory Proteins ^ 

How does a cell determine which of its thousands of genes to transcribe? As dis- 
cussed in Chapter 8, the transcription of each gene is controlled by a regulatory 
region of DNA near the site where transcription begins. Some regulatory regions 
are simple and act as switches that are tlirown by a single signal. Other regula- 
tory re^ons are complex and act as tiny microprocessors, responding to a vari- 
ety of signals that they interpret £ind integrate to switch the neighboring gene on 
or off. Whether complex or simple, these switching devices consist of two fun- 
damental types of components: (1) short stretches of DNA of defined sequence 
and (2) gene regulatory proteins that recognize and bind to them. 

We begin our discussion of gene regulatory proteins by describing how these 
protieins were discovered. 

Gene Regulatory Proteins Were Discovered Using 
Bacterial Genetics ^ 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
of the existence of gene regulatory proteins that turn specific sets of genes on 
or off. One of these regulators, the lambda repressor, is encoded by a bacterial 
virus, bacteriophage lambda The repressor shuts off the viral genes that code for 
the protein components of new virus particles and thereby enables the viral ge- 
nome to remain a silent passenger in the bacterial chromosome, multiplying with 
the bacterium when conditions are favorable for bacterial growth (see Figure 
6-80). The lambda repressor was among the first gene regulatory proteins to be 
characterized, and it remains one of the best understood, as we discuss later. 
Other bacterial regulators respond to nutritional conditions by shutting off genes 
encoding specific sets of metabolic enzymes when they are not needed. The lac 
repressor, for example, the first of these bacterial proteins to be recognized, turns 
off the production of the proteins responsible for lactose metabolism when this 
sugar is absent from the medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proved, that most 
of these mutants were deficient in proteins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fractionating cell extracts on a scries of stan- 
dard chromatography columns (see pp. 166-169). Once isolated, the pro- 
teins were shown to bind to specific DNA sequences close to the genes that they 
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Figure 9-3 Double-heUcal su^cturc 
of DNA. The major and minor grooves 
on the outside of the double helix. 
indicated. The atoms are colored as 
follows: carbon, dark blue; nitrogeHr 
light blue; hydrogen, white; oxygen* 
red; phosphorus, yellow. 
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Figure 9-71 A mechanism to ei^lain 
both the marked deficiemy of CG 
sequences and the presence of CG 
islands in vertebrate genomes. A 
black linemsaks the location of an 
umnethyiated CG dinudeotide in the 
DNA sequence, while a red line marks 
the location of a methylated CG 
dinudeotide. 
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the many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many spe- 
daHzed animal cells can maintain their unique character when grown in culture^ the 
gene regulatory mechanisms involved in creating them must be stable once estab- 
lished and heritable when the cell divides, endowing the cell with a memory of its 
developmental history. Procaryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types in higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teinSf each of which inhibits the synthesis of the other; this can create afUp-flop 
switch that switches a cell between two alternative patterns of gene expression. Di- 
rect or indirect positive feedback loops, which ermble gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for cell memory. 

In eucaryotes gene transcription is generally controlled by combinations of gene 
rectory proteins. It is thought that each type of cell in a higher eucaryotic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
reflation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utUized by eucaryotic cells to regulate gene expression In ver- 
tebrates DNA methylation also plays a part, mainly as a device to reinforce decisions 
oboutgene expression that are made initiaUy by other mechanisms. 



Posttranscriptional Controls 

* • 

Although controls on the initiation of gene transcription are the predominant 
^nn of regulation for most genes, other controls can act later in the pathway 
om RNA to protein to modulate the amount of gene product that is made. Al- 
^ ough these posttranscriptionai controls, which operate after RNA polymerase 
^ bound to the gene's promoter and begun RNA synthesis, are less common 

transcriptional control, fox many genes they are crudal. It seems that every 
lated"^ expression that could be controUed in prindple is likely to be regu- 

under some circumstances for some genes, 
der ^e varieties of posttranscriptionai regulation in temporal or- 

moie^^T^^'^^ the sequence of events that might be experienced by an RNA 

cule after its transcription has begun (Figure 9-72). 
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expression. Only a few of these 
controls are likely to be used for any 
one gene. 



scriptional Controls 



453 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 



\JI BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 



□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 





LINES OR MARKS ON ORIGINAL DOCUMENT 



